Computational 

Geometry 

Algorithms  and  Applications 
Third  Edition 


4^  Springer 


Computational  Geometry 


Third  Edition 


Mark  de  Berg  •  Otfried  Cheong 
Marc  van  Kreveld  •  Mark  Overmars 


Computational  Geometry 

Algorithms  and  Applications 
Third  Edition 


4^  Springer 


Dr.  Marc  van  Kreveld 


Prof.  Dr.  Mark  de  Berg 

Department  of  Mathematics 
and  Computer  Science 
TU  Eindhoven 
P.O.  Box  513 
5600  MB  Eindhoven 
The  Netherlands 

mdberg  @  win .  tue  .nl 


Department  of  Information 
and  Computing  Sciences 
Utrecht  University 
P.O.  Box  80.089 
3508  TB  Utrecht 
The  Netherlands 

marc@cs.uu.nl 


Dr.  Otfried  Cheong,  ne  Schwarzkopf 
Department  of  Computer  Science 
KAIST 

Gwahangno  335,  Yuseong-gu 

Daejeon  305-701 

Korea 

otfried@kaist.edu 


Prof.  Dr.  Mark  Overmars 
Department  of  Information 
and  Computing  Sciences 
Utrecht  University 
P.O.  Box  80.089 
3508  TB  Utrecht 
The  Netherlands 
markov@cs.uu.nl 


ISBN  978-3-540-77973-5  e-ISBN  978-3-540-77974-2 

DOI  10.1007/978-3-540-77974-2 

ACM  Computing  Classification  (1998):  F.2.2, 1.3.5 

Library  of  Congress  Control  Number:  2008921564 

©  2008,  2000,  1997  Springer-Verlag  Berlin  Heidelberg 

This  work  is  subject  to  copyright.  All  rights  are  reserved,  whether  the  whole  or  part  of  the  material 
is  concerned,  specifically  the  rights  of  translation,  reprinting,  reuse  of  illustrations,  recitation, 
broadcasting,  reproduction  on  microfilm  or  in  any  other  way,  and  storage  in  data  banks.  Duplication 
of  this  publication  or  parts  thereof  is  permitted  only  under  the  provisions  of  the  German  Copyright 
Law  of  September  9,  1965,  in  its  current  version,  and  permission  for  use  must  always  be  obtained 
from  Springer.  Violations  are  liable  for  prosecution  under  the  German  Copyright  Law. 

The  use  of  general  descriptive  names,  registered  names,  trademarks,  etc.  in  this  publication  does 
not  imply,  even  in  the  absence  of  a  specific  statement,  that  such  names  are  exempt  from  the  relevant 
protective  laws  and  regulations  and  therefore  free  for  general  use. 

Cover  design:  KiinkelLopka,  Heidelberg 

Printed  on  acid-free  paper 

987654321 


sprmger.com 


Preface 


Computational  geometry  emerged  from  the  field  of  algorithms  design  and 
analysis  in  the  late  1970s.  It  has  grown  into  a  recognized  discipline  with  its 
own  journals,  conferences,  and  a  large  community  of  active  researchers.  The 
success  of  the  field  as  a  research  discipline  can  on  the  one  hand  be  explained 
from  the  beauty  of  the  problems  studied  and  the  solutions  obtained,  and,  on  the 
other  hand,  by  the  many  application  domains — computer  graphics,  geographic 
information  systems  (GIS),  robotics,  and  others — in  which  geometric  algorithms 
play  a  fundamental  role. 

For  many  geometric  problems  the  early  algorithmic  solutions  were  either 
slow  or  difficult  to  understand  and  implement.  In  recent  years  a  number  of  new 
algorithmic  techniques  have  been  developed  that  improved  and  simplified  many 
of  the  previous  approaches.  In  this  textbook  we  have  tried  to  make  these  modern 
algorithmic  solutions  accessible  to  a  large  audience.  The  book  has  been  written 
as  a  textbook  for  a  course  in  computational  geometry,  but  it  can  also  be  used  for 
self-study. 


Structure  of  the  book.  Each  of  the  sixteen  chapters  (except  the  introductory 
chapter)  starts  with  a  problem  arising  in  one  of  the  application  domains.  This 
problem  is  then  transformed  into  a  purely  geometric  one,  which  is  solved 
using  techniques  from  computational  geometry.  The  geometric  problem  and  the 
concepts  and  techniques  needed  to  solve  it  are  the  real  topic  of  each  chapter.  The 
choice  of  the  applications  was  guided  by  the  topics  in  computational  geometry 
we  wanted  to  cover;  they  are  not  meant  to  provide  a  good  coverage  of  the 
application  domains.  The  purpose  of  the  applications  is  to  motivate  the  reader; 
the  goal  of  the  chapters  is  not  to  provide  ready-to-use  solutions  for  them.  Having 
said  this,  we  believe  that  knowledge  of  computational  geometry  is  important 
to  solve  geometric  problems  in  application  areas  efficiently.  We  hope  that  our 
book  will  not  only  raise  the  interest  of  people  from  the  algorithms  community, 
but  also  from  people  in  the  application  areas. 

For  most  geometric  problems  treated  we  give  just  one  solution,  even  when 
a  number  of  different  solutions  exist.  In  general  we  have  chosen  the  solution 
that  is  easiest  to  understand  and  implement.  This  is  not  necessarily  the  most 
efficient  solution.  We  also  took  care  that  the  book  contains  a  good  mixture  of 
techniques  like  divide-and-conquer,  plane  sweep,  and  randomized  algorithms. 

We  decided  not  to  treat  all  sorts  of  variations  to  the  problems;  we  felt  it  is  more 
important  to  introduce  all  main  topics  in  computational  geometry  than  to  give 

more  detailed  information  about  a  smaller  number  of  topics.  v 


vi 


Preface  Several  chapters  contain  one  or  more  sections  marked  with  a  star.  They  con¬ 

tain  improvements  of  the  solution,  extensions,  or  explain  the  relation  between 
various  problems.  They  are  not  essential  for  understanding  the  remainder  of  the 
book. 

Every  chapter  concludes  with  a  section  that  is  entitled  Notes  and  Comments. 
These  sections  indicate  where  the  results  described  in  the  chapter  originated, 
mention  other  solutions,  generalizations,  and  improvements,  and  provide  refer¬ 
ences.  They  can  be  skipped,  but  do  contain  useful  material  for  those  who  want 
to  know  more  about  the  topic  of  the  chapter. 

At  the  end  of  each  chapter  a  number  of  exercises  is  provided.  These  range 
from  easy  tests  to  check  whether  the  reader  understands  the  material  to  more 
elaborate  questions  that  extend  the  material  covered.  Difficult  exercises  and 
exercises  about  starred  sections  are  indicated  with  a  star. 

A  course  outline.  Even  though  the  chapters  in  this  book  are  largely  indepen¬ 
dent,  they  should  preferably  not  be  treated  in  an  arbitrary  order.  For  instance. 
Chapter  2  introduces  plane  sweep  algorithms,  and  it  is  best  to  read  this  chapter 
before  any  of  the  other  chapters  that  use  this  technique.  Similarly,  Chapter  4 
should  be  read  before  any  other  chapter  that  uses  randomized  algorithms. 

For  a  first  course  on  computational  geometry,  we  advise  treating  Chapters  1- 
10  in  the  given  order.  They  cover  the  concepts  and  techniques  that,  according 
to  us,  should  be  present  in  any  course  on  computational  geometry.  When  more 
material  can  be  covered,  a  selection  can  be  made  from  the  remaining  chapters. 

Prerequisites.  The  book  can  be  used  as  a  textbook  for  a  high-level  under¬ 
graduate  course  or  a  low-level  graduate  course,  depending  on  the  rest  of  the 
curriculum.  Readers  are  assumed  to  have  a  basic  knowledge  of  the  design  and 
analysis  of  algorithms  and  data  structures:  they  should  be  familiar  with  big-Oh 
notations  and  simple  algorithmic  techniques  like  sorting,  binary  search,  and 
balanced  search  trees.  No  knowledge  of  the  application  domains  is  required,  and 
hardly  any  knowledge  of  geometry.  The  analysis  of  the  randomized  algorithms 
uses  some  very  elementary  probability  theory. 

Implementations.  The  algorithms  in  this  book  are  presented  in  a  pseudo¬ 
code  that,  although  rather  high-level,  is  detailed  enough  to  make  it  relatively 
easy  to  implement  them.  In  particular  we  have  tried  to  indicate  how  to  handle 
degenerate  cases,  which  are  often  a  source  of  frustration  when  it  comes  to 
implementing. 

We  believe  that  it  is  very  useful  to  implement  one  or  more  of  the  algorithms; 
it  will  give  a  feeling  for  the  complexity  of  the  algorithms  in  practice.  Each 
chapter  can  be  seen  as  a  programming  project.  Depending  on  the  amount  of 
time  available  one  can  either  just  implement  the  plain  geometric  algorithms,  or 
implement  the  application  as  well. 

To  implement  a  geometric  algorithm  a  number  of  basic  data  types — points, 
lines,  polygons,  and  so  on — and  basic  routines  that  operate  on  them  are  needed. 
Implementing  these  basic  routines  in  a  robust  manner  is  not  easy,  and  takes  a  lot 


of  time.  Although  it  is  good  to  do  this  at  least  once,  it  is  useful  to  have  a  software  Preface 
library  available  that  contains  the  basic  data  types  and  routines.  Pointers  to  such 
libraries  can  be  found  on  our  Web  site. 

Web  site.  This  book  is  accompanied  by  a  Web  site,  which  contains  a  list  of 
errata  collected  for  each  edition  of  the  book,  all  figures  and  the  pseudo  code  for 
all  algorithms,  as  well  as  some  other  resources.  The  address  is 

http : / /www . cs . uu . nl/ geobook/ 

You  can  also  use  the  address  given  on  our  Web  site  to  send  us  errors  you 
have  found,  or  any  other  comments  you  have  about  the  book. 

About  the  third  edition.  This  third  edition  contains  two  major  additions:  In 
Chapter  7,  on  Voronoi  diagrams,  we  now  also  discuss  Voronoi  diagrams  of  line 
segments  and  farthest-point  Voronoi  diagrams.  In  Chapter  12,  we  have  included 
an  extra  section  on  binary  space  partition  trees  for  low-density  scenes,  as  an 
introduction  to  realistic  input  models.  In  addition,  a  large  number  of  small  and 
some  larger  errors  have  been  corrected  (see  the  list  of  errata  for  the  second 
edition  on  the  Web  site).  We  have  also  updated  the  notes  and  comments  of  every 
chapter  to  include  references  to  recent  results  and  recent  literature.  We  have 
tried  not  to  change  the  numbering  of  sections  and  exercises,  so  that  it  should  be 
possible  for  students  in  a  course  to  still  use  the  second  edition. 

Acknowledgments.  Writing  a  textbook  is  a  long  process,  even  with  four 
authors.  Many  people  contributed  to  the  original  first  edition  by  providing 
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suggesting  changes,  and  by  finding  and  correcting  errors.  Many  more  provided 
feedback  and  found  errors  in  the  first  two  editions.  We  would  like  to  thank  all  of 
them,  in  particular  Pankaj  Agarwal,  Helmut  Alt,  Marshall  Bern,  Jit  Bose,  Hazel 
Everett,  Gerald  Farin,  Steve  Fortune,  Geert-Jan  Giezeman,  Mordecai  Golin,  Dan 
Halperin,  Richard  Karp,  Matthew  Katz,  Klara  Kedem,  Nelson  Max,  Joseph  S.  B. 
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1  Computational  Geometry 


Introduction 


Imagine  you  are  walking  on  the  campus  of  a  university  and  suddenly  you  realize 
you  have  to  make  an  urgent  phone  call.  There  are  many  public  phones  on 
campus  and  of  course  you  want  to  go  to  the  nearest  one.  But  which  one  is  the 
nearest?  It  would  be  helpful  to  have  a  map  on  which  you  could  look  up  the 
nearest  public  phone,  wherever  on  campus  you  are.  The  map  should  show  a 
subdivision  of  the  campus  into  regions,  and  for  each  region  indicate  the  nearest 
public  phone.  What  would  these  regions  look  like?  And  how  could  we  compute 
them? 

Even  though  this  is  not  such  a  terribly  important  issue,  it  describes  the  basics 
of  a  fundamental  geometric  concept,  which  plays  a  role  in  many  applications. 
The  subdivision  of  the  campus  is  a  so-called  Voronoi  diagram,  and  it  will  be 
studied  in  Chapter  7  in  this  book.  It  can  be  used  to  model  trading  areas  of 
different  cities,  to  guide  robots,  and  even  to  describe  and  simulate  the  growth 
of  crystals.  Computing  a  geometric  structure  like  a  Voronoi  diagram  requires 
geometric  algorithms.  Such  algorithms  form  the  topic  of  this  book. 

A  second  example.  Assume  you  located  the  closest  public  phone.  With 
a  campus  map  in  hand  you  will  probably  have  little  problem  in  getting  to  the 
phone  along  a  reasonably  short  path,  without  hitting  walls  and  other  objects. 
But  programming  a  robot  to  perform  the  same  task  is  a  lot  more  difficult.  Again, 
the  heart  of  the  problem  is  geometric:  given  a  collection  of  geometric  obstacles, 
we  have  to  find  a  short  connection  between  two  points,  avoiding  collisions  with 
the  obstacles.  Solving  this  so-called  motion  planning  problem  is  of  crucial 
importance  in  robotics.  Chapters  13  and  15  deal  with  geometric  algorithms 
required  for  motion  planning. 

A  third  example.  Assume  you  don’t  have  one  map  but  two:  one  with 
a  description  of  the  various  buildings,  including  the  public  phones,  and  one 
indicating  the  roads  on  the  campus.  To  plan  a  motion  to  the  public  phone  we 
have  to  overlay  these  maps,  that  is,  we  have  to  combine  the  information  in 
the  two  maps.  Overlaying  maps  is  one  of  the  basic  operations  of  geographic 
information  systems.  It  involves  locating  the  position  of  objects  from  one  map 
in  the  other,  computing  the  intersection  of  various  features,  and  so  on.  Chapter  2 
deals  with  this  problem. 
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COMPUTATIONAL  GEOMETRY 


convex  not  convex 
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These  are  just  three  examples  of  geometric  problems  requiring  carefully  de¬ 
signed  geometric  algorithms  for  their  solution.  In  the  1970s  the  field  of  compu¬ 
tational  geometry  emerged,  dealing  with  such  geometric  problems.  It  can  be 
defined  as  the  systematic  study  of  algorithms  and  data  structures  for  geometric 
objects,  with  a  focus  on  exact  algorithms  that  are  asymptotically  fast.  Many 
researchers  were  attracted  by  the  challenges  posed  by  the  geometric  problems. 
The  road  from  problem  formulation  to  efficient  and  elegant  solutions  has  often 
been  long,  with  many  difficult  and  sub-optimal  intermediate  results.  Today  there 
is  a  rich  collection  of  geometric  algorithms  that  are  efficient,  and  relatively  easy 
to  understand  and  implement. 

This  book  describes  the  most  important  notions,  techniques,  algorithms, 
and  data  structures  from  computational  geometry  in  a  way  that  we  hope  will  be 
attractive  to  readers  who  are  interested  in  applying  results  from  computational 
geometry.  Each  chapter  is  motivated  with  a  real  computational  problem  that 
requires  geometric  algorithms  for  its  solution.  To  show  the  wide  applicability 
of  computational  geometry,  the  problems  were  taken  from  various  application 
areas:  robotics,  computer  graphics,  CAD/CAM,  and  geographic  information 
systems. 

You  should  not  expect  ready-to-implement  software  solutions  for  major 
problems  in  the  application  areas.  Every  chapter  deals  with  a  single  concept  in 
computational  geometry;  the  applications  only  serve  to  introduce  and  motivate 
the  concepts.  They  also  illustrate  the  process  of  modeling  an  engineering 
problem  and  finding  an  exact  solution. 


1.1  An  Example:  Convex  Hulls 

Good  solutions  to  algorithmic  problems  of  a  geometric  nature  are  mostly  based 
on  two  ingredients.  One  is  a  thorough  understanding  of  the  geometric  properties 
of  the  problem,  the  other  is  a  proper  application  of  algorithmic  techniques  and 
data  structures.  If  you  don’t  understand  the  geometry  of  the  problem,  all  the 
algorithms  of  the  world  won’t  help  you  to  solve  it  efficiently.  On  the  other  hand, 
even  if  you  perfectly  understand  the  geometry  of  the  problem,  it  is  hard  to  solve 
it  effectively  if  you  don’t  know  the  right  algorithmic  techniques.  This  book  will 
give  you  a  thorough  understanding  of  the  most  important  geometric  concepts 
and  algorithmic  techniques. 

To  illustrate  the  issues  that  arise  in  developing  a  geometric  algorithm,  this 
section  deals  with  one  of  the  first  problems  that  was  studied  in  computational 
geometry:  the  computation  of  planar  convex  hulls.  We’ll  skip  the  motivation 
for  this  problem  here;  if  you  are  interested  you  can  read  the  introduction  to 
Chapter  11,  where  we  study  convex  hulls  in  3-dimensional  space. 

A  subset  S  of  the  plane  is  called  convex  if  and  only  if  for  any  pair  of  points 
p,q  £  S  the  line  segment  pq  is  completely  contained  in  5.  The  convex  hull 
G!K(S)  of  a  set  S  is  the  smallest  convex  set  that  contains  S.  To  be  more  precise, 
it  is  the  intersection  of  all  convex  sets  that  contain  5. 


We  will  study  the  problem  of  computing  the  convex  hull  of  a  finite  set  P 
of  n  points  in  the  plane.  We  can  visualize  what  the  convex  hull  looks  like  by  a 
thought  experiment.  Imagine  that  the  points  are  nails  sticking  out  of  the  plane, 
take  an  elastic  rubber  band,  hold  it  around  the  nails,  and  let  it  go.  It  will  snap 
around  the  nails,  minimizing  its  length.  The  area  enclosed  by  the  rubber  band 
is  the  convex  hull  of  P.  This  leads  to  an  alternative  definition  of  the  convex 
hull  of  a  finite  set  P  of  points  in  the  plane:  it  is  the  unique  convex  polygon 
whose  vertices  are  points  from  P  and  that  contains  all  points  of  P.  Of  course 
we  should  prove  rigorously  that  this  is  well  defined — that  is,  that  the  polygon  is 
unique — and  that  the  definition  is  equivalent  to  the  one  given  earlier,  but  let’s 
skip  that  in  this  introductory  chapter. 

How  do  we  compute  the  convex  hull?  Before  we  can  answer  this  question  we 
must  ask  another  question:  what  does  it  mean  to  compute  the  convex  hull? 
As  we  have  seen,  the  convex  hull  of  P  is  a  convex  polygon.  A  natural  way 
to  represent  a  polygon  is  by  listing  its  vertices  in  clockwise  order,  starting 
with  an  arbitrary  one.  So  the  problem  we  want  to  solve  is  this:  given  a  set 
P  =  {p\ ,p2,  ■  . .  ,p„}  of  points  in  the  plane,  compute  a  list  that  contains  those 
points  from  P  that  are  the  vertices  of  G TC(P),  listed  in  clockwise  order. 

P9 

input  =  set  of  points: 

Pl,P2,P3,P4,P5,P6,Pl,PS,P9 
output  =  representation  of  the  convex  hull: 

Pi,P5,PS,P2,P9 

PS 

The  first  definition  of  convex  hulls  is  of  little  help  when  we  want  to  design 
an  algorithm  to  compute  the  convex  hull.  It  talks  about  the  intersection  of  all 
convex  sets  containing  P,  of  which  there  are  infinitely  many.  The  observation 
that  G'K(P)  is  a  convex  polygon  is  more  useful.  Let’s  see  what  the  edges  of 
G3~C(P)  are.  Both  endpoints  p  and  q  of  such  an  edge  are  points  of  P,  and  if  we 
direct  the  line  through  p  and  q  such  that  GK(P)  lies  to  the  right,  then  all  the 
points  of  P  must  lie  to  the  right  of  this  line.  The  reverse  is  also  true:  if  all  points 
of  P\{p,q}  lie  to  the  right  of  the  directed  line  through  p  and  q ,  then  ~pq  is  an 
edge  of  GK (P). 

Now  that  we  understand  the  geometry  of  the  problem  a  little  bit  better  we  can 
develop  an  algorithm.  We  will  describe  it  in  a  style  of  pseudocode  we  will  use 
throughout  this  book. 

Algorithm  SlowConvexHull(P) 

Input.  A  set  P  of  points  in  the  plane. 

Output.  A  list  L  containing  the  vertices  of  G.'K(P)  in  clockwise  order. 

1.  £<-0. 

2.  for  all  ordered  pairs  (p,  q)  £  P  x  P  with  p  not  equal  to  q 

3.  do  valid  <—  true 
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Figure  1.1 

Computing  a  convex  hull 
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destination  of  e\ 
=  origin  of  e^. 


origin  of  e\ 


4.  for  all  points  r  £  P  not  equal  to  p  or  q 

5.  do  if  r  lies  to  the  left  of  the  directed  line  from  p  to  q 

6.  then  valid  <—  false. 

7.  if  valid  then  Add  the  directed  edge  pq  to  E. 

8.  From  the  set  E  of  edges  construct  a  list  L  of  vertices  of  ll'.K(P),  sorted  in 
clockwise  order. 

Two  steps  in  the  algorithm  are  perhaps  not  entirely  clear. 

The  first  one  is  line  5:  how  do  we  test  whether  a  point  lies  to  the  left  or  to  the 
right  of  a  directed  line?  This  is  one  of  the  primitive  operations  required  in  most 
geometric  algorithms.  Throughout  this  book  we  assume  that  such  operations 
are  available.  It  is  clear  that  they  can  be  performed  in  constant  time  so  the 
actual  implementation  will  not  affect  the  asymptotic  running  time  in  order  of 
magnitude.  This  is  not  to  say  that  such  primitive  operations  are  unimportant  or 
trivial.  They  are  not  easy  to  implement  correctly  and  their  implementation  will 
affect  the  actual  running  time  of  the  algorithm.  Fortunately,  software  libraries 
containing  such  primitive  operations  are  nowadays  available.  We  conclude  that 
we  don’t  have  to  worry  about  the  test  in  line  5;  we  may  assume  that  we  have  a 
function  available  performing  the  test  for  us  in  constant  time. 

The  other  step  of  the  algorithm  that  requires  some  explanation  is  the  last  one. 
In  the  loop  of  lines  2-7  we  determine  the  set  E  of  convex  hull  edges.  From  E  we 
can  construct  the  list  L  as  follows.  The  edges  in  E  are  directed,  so  we  can  speak 
about  the  origin  and  the  destination  of  an  edge.  Because  the  edges  are  directed 
such  that  the  other  points  lie  to  their  right,  the  destination  of  an  edge  comes 
after  its  origin  when  the  vertices  are  listed  in  clockwise  order.  Now  remove 
an  arbitrary  edge  ej  from  E.  Put  the  origin  of  e\  as  the  first  point  into  £,  and 
the  destination  as  the  second  point.  Find  the  edge  e2  in  E  whose  origin  is  the 
destination  of  e\,  remove  it  from  E ,  and  append  its  destination  to  L.  Next,  find 
the  edge  ej,  whose  origin  is  the  destination  of  ej,  remove  it  from  E,  and  append 
its  destination  to  L.  We  continue  in  this  manner  until  there  is  only  one  edge  left 
in  E.  Then  we  are  done;  the  destination  of  the  remaining  edge  is  necessarily  the 
origin  of  e\,  which  is  already  the  first  point  in  E.  A  simple  implementation  of 
this  procedure  takes  0{rr)  time.  This  can  easily  be  improved  to  O(nlogn),  but 
the  time  required  for  the  rest  of  the  algorithm  dominates  the  total  running  time 
anyway. 

Analyzing  the  time  complexity  of  SlowConvexHull  is  easy.  We  check 
n2  —  n  pairs  of  points.  For  each  pair  we  look  at  the  n  —  2  other  points  to  see 
whether  they  all  lie  on  the  right  side.  This  will  take  0(rti)  time  in  total.  The 
final  step  takes  0(n2)  time,  so  the  total  running  time  is  <9(n3).  An  algorithm 
with  a  cubic  running  time  is  too  slow  to  be  of  practical  use  for  anything  but  the 
smallest  input  sets.  The  problem  is  that  we  did  not  use  any  clever  algorithmic 
design  techniques;  we  just  translated  the  geometric  insight  into  an  algorithm  in 
a  brute-force  manner.  But  before  we  try  to  do  better,  it  is  useful  to  make  several 
observations  about  this  algorithm. 

We  have  been  a  bit  careless  when  deriving  the  criterion  of  when  a  pair  p,q 
defines  an  edge  of  CK(P).  A  point  r  does  not  always  lie  to  the  right  or  to  the 


left  of  the  line  through  p  and  q,  it  can  also  happen  that  it  lies  on  this  line.  What 
should  we  do  then?  This  is  what  we  call  a  degenerate  case,  or  a  degeneracy  for 
short.  We  prefer  to  ignore  such  situations  when  we  first  think  about  a  problem, 
so  that  we  don’t  get  confused  when  we  try  to  figure  out  the  geometric  properties 
of  a  problem.  However,  these  situations  do  arise  in  practice.  For  instance,  if 
we  create  the  points  by  clicking  on  a  screen  with  a  mouse,  all  points  will  have 
small  integer  coordinates,  and  it  is  quite  likely  that  we  will  create  three  points 
on  a  line. 

To  make  the  algorithm  correct  in  the  presence  of  degeneracies  we  must 
reformulate  the  criterion  above  as  follows:  a  directed  edge  pq  is  an  edge  of 
G,‘K(P)  if  and  only  if  all  other  points  r  €  P  lie  either  strictly  to  the  right  of  the 
directed  line  through  p  and  q,  or  they  lie  on  the  open  line  segment  ~pq.  (We 
assume  that  there  are  no  coinciding  points  in  P.)  So  we  have  to  replace  line  5  of 
the  algorithm  by  this  more  complicated  test. 
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We  have  been  ignoring  another  important  issue  that  can  influence  the  correctness 
of  the  result  of  our  algorithm.  We  implicitly  assumed  that  we  can  somehow 
test  exactly  whether  a  point  lies  to  the  right  or  to  the  left  of  a  given  line.  This 
is  not  necessarily  true:  if  the  points  are  given  in  floating  point  coordinates  and 
the  computations  are  done  using  floating  point  arithmetic,  then  there  will  be 
rounding  errors  that  may  distort  the  outcome  of  tests. 

Imagine  that  there  are  three  points  p,  q,  and  r,  that  are  nearly  collinear,  and 
that  all  other  points  lie  far  to  the  right  of  them.  Our  algorithm  tests  the  pairs 
( p,q ),  ( r,q ),  and  (p,r).  Since  these  points  are  nearly  collinear,  it  is  possible  that 
the  rounding  errors  lead  us  to  decide  that  r  lies  to  the  right  of  the  line  from  p  to 
q,  that  p  lies  to  the  right  of  the  line  from  r  to  q,  and  that  q  lies  to  the  right  of  the 
line  from  p  to  r.  Of  course  this  is  geometrically  impossible — but  the  floating 
point  arithmetic  doesn’t  know  that!  In  this  case  the  algorithm  will  accept  all 
three  edges.  Even  worse,  all  three  tests  could  give  the  opposite  answer,  in  which 
case  the  algorithm  rejects  all  three  edges,  leading  to  a  gap  in  the  boundary  of 
the  convex  hull.  And  this  leads  to  a  serious  problem  when  we  try  to  construct 
the  sorted  list  of  convex  hull  vertices  in  the  last  step  of  our  algorithm.  This  step 
assumes  that  there  is  exactly  one  edge  starting  in  every  convex  hull  vertex,  and 
exactly  one  edge  ending  there.  Due  to  the  rounding  errors  there  can  suddenly  be 
two,  or  no,  edges  starting  in  vertex  p.  This  can  cause  the  program  implementing 
our  simple  algorithm  to  crash,  since  the  last  step  has  not  been  designed  to  deal 
with  such  inconsistent  data. 

Although  we  have  proven  the  algorithm  to  be  correct  and  to  handle  all 
special  cases,  it  is  not  robust:  small  errors  in  the  computations  can  make  it 
fail  in  completely  unexpected  ways.  The  problem  is  that  we  have  proven  the 
correctness  assuming  that  we  can  compute  exactly  with  real  numbers. 


We  have  designed  our  first  geometric  algorithm.  It  computes  the  convex  hull 
of  a  set  of  points  in  the  plane.  However,  it  is  quite  slow — its  running  time  is 
0(n3) — ,  it  deals  with  degenerate  cases  in  an  awkward  way,  and  it  is  not  robust. 
We  should  try  to  do  better. 
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upper  hull 


lower  hull 


points  deleted 


To  this  end  we  apply  a  standard  algorithmic  design  technique:  we  will 
develop  an  incremental  algorithm.  This  means  that  we  will  add  the  points  in  P 
one  by  one,  updating  our  solution  after  each  addition.  We  give  this  incremental 
approach  a  geometric  flavor  by  adding  the  points  from  left  to  right.  So  we  first 
sort  the  points  by  x-coordinate,  obtaining  a  sorted  sequence  p\,...,pn,  and  then 
we  add  them  in  that  order.  Because  we  are  working  from  left  to  right,  it  would 
be  convenient  if  the  convex  hull  vertices  were  also  ordered  from  left  to  right 
as  they  occur  along  the  boundary.  But  this  is  not  the  case.  Therefore  we  first 
compute  only  those  convex  hull  vertices  that  lie  on  the  upper  hull,  which  is  the 
part  of  the  convex  hull  running  from  the  leftmost  point  p  \  to  the  rightmost  point 
p„  when  the  vertices  are  listed  in  clockwise  order.  In  other  words,  the  upper 
hull  contains  the  convex  hull  edges  bounding  the  convex  hull  from  above.  In  a 
second  scan,  which  is  performed  from  right  to  left,  we  compute  the  remaining 
part  of  the  convex  hull,  the  lower  hull. 

The  basic  step  in  the  incremental  algorithm  is  the  update  of  the  upper  hull 
after  adding  a  point  pt.  In  other  words,  given  the  upper  hull  of  the  points 
pi,... ,  Pi-i,  we  have  to  compute  the  upper  hull  of  pi,..  -,pi-  This  can  be  done 
as  follows.  When  we  walk  around  the  boundary  of  a  polygon  in  clockwise  order, 
we  make  a  turn  at  every  vertex.  For  an  arbitrary  polygon  this  can  be  both  a 
right  turn  and  a  left  turn,  but  for  a  convex  polygon  every  turn  must  be  a  right 
turn.  This  suggests  handling  the  addition  of  pi  in  the  following  way.  Let  Cupper 
be  a  list  that  stores  the  upper  vertices  in  left-to-right  order.  We  first  append  p, 
to  -Cupper-  This  is  correct  because  /?,-  is  the  rightmost  point  of  the  ones  added  so 
far,  so  it  must  be  on  the  upper  hull.  Next,  we  check  whether  the  last  three  points 
in  Cupper  make  a  right  turn.  If  this  is  the  case  there  is  nothing  more  to  do;  Cupper 
contains  the  vertices  of  the  upper  hull  of  p\,...,pj,  and  we  can  proceed  to  the 
next  point,  pi+\.  But  if  the  last  three  points  make  a  left  turn,  we  have  to  delete 
the  middle  one  from  the  upper  hull.  In  this  case  we  are  not  finished  yet:  it  could 
be  that  the  new  last  three  points  still  do  not  make  a  right  turn,  in  which  case  we 
again  have  to  delete  the  middle  one.  We  continue  in  this  manner  until  the  last 
three  points  make  a  right  turn,  or  until  there  are  only  two  points  left. 

We  now  give  the  algorithm  in  pseudocode.  The  pseudocode  computes  both  the 
upper  hull  and  the  lower  hull.  The  latter  is  done  by  treating  the  points  from  right 
to  left,  analogous  to  the  computation  of  the  upper  hull. 

Algorithm  ConvexHullCP) 

Input.  A  set  P  of  points  in  the  plane. 

Output.  A  list  containing  the  vertices  of  tl'K(P)  in  clockwise  order. 

1.  Sort  the  points  by  x-coordinate,  resulting  in  a  sequence  p\,...,pn. 

2.  Put  the  points  p\  and  pi  in  a  list  CUppCr,  with  p\  as  the  first  point. 

3.  for  i  <—  3  to  n 

4.  do  Append  p ,■  to  Cupper. 

5.  while  Cupper  contains  more  than  two  points  and  the  last  three  points 

in  Cupper  do  not  make  a  right  turn 

6.  do  Delete  the  middle  of  the  last  three  points  from  CUpper. 

7.  Put  the  points  pn  and  p„-\  in  a  list  Ciower,  with  pn  as  the  first  point. 


8.  for  i  <—  n  —  2  downto  1 

9.  do  Append  /?,  to  -C i0Wei  - 

10.  while  -C lower  contains  more  than  2  points  and  the  last  three  points 

in  Hiower  d°  n°t  make  a  right  turn 

11.  do  Delete  the  middle  of  the  last  three  points  from  £|ower. 

12.  Remove  the  first  and  the  last  point  from  £]ower  to  avoid  duplication  of  the 
points  where  the  upper  and  lower  hull  meet. 

13.  Append  £iOWer  to  £Upper.  and  call  the  resulting  list  L. 

14.  return  L 

Once  again,  when  we  look  closer  we  realize  that  the  above  algorithm  is  not 
correct.  Without  mentioning  it,  we  made  the  assumption  that  no  two  points  have 
the  same  x-coordinate.  If  this  assumption  is  not  valid  the  order  on  x-coordinate 
is  not  well  defined.  Fortunately,  this  turns  out  not  to  be  a  serious  problem. 
We  only  have  to  generalize  the  ordering  in  a  suitable  way:  rather  than  using 
only  the  x-coordinate  of  the  points  to  define  the  order,  we  use  the  lexicographic 
order.  This  means  that  we  first  sort  by  x-coordinate,  and  if  points  have  the  same 
x-coordinate  we  sort  them  by  y-coordinate. 

Another  special  case  we  have  ignored  is  that  the  three  points  for  which  we 
have  to  determine  whether  they  make  a  left  or  a  right  turn  lie  on  a  straight  line. 
In  this  case  the  middle  point  should  not  occur  on  the  convex  hull,  so  collinear 
points  must  be  treated  as  if  they  make  a  left  turn.  In  other  words,  we  should  use 
a  test  that  returns  true  if  the  three  points  make  a  right  turn,  and  false  otherwise. 
(Note  that  this  is  simpler  than  the  test  required  in  the  previous  algorithm  when 
there  were  collinear  points.) 

With  these  modifications  the  algorithm  correctly  computes  the  convex  hull: 
the  first  scan  computes  the  upper  hull,  which  is  now  defined  as  the  part  of  the 
convex  hull  running  from  the  lexicographically  smallest  vertex  to  the  lexico¬ 
graphically  largest  vertex,  and  the  second  scan  computes  the  remaining  part  of 
the  convex  hull. 

What  does  our  algorithm  do  in  the  presence  of  rounding  errors  in  the  floating 
point  arithmetic?  When  such  errors  occur,  it  can  happen  that  a  point  is  removed 
from  the  convex  hull  although  it  should  be  there,  or  that  a  point  inside  the  real 
convex  hull  is  not  removed.  But  the  structural  integrity  of  the  algorithm  is 
unharmed:  it  will  compute  a  closed  polygonal  chain.  After  all.  the  output  is 
a  list  of  points  that  we  can  interpret  as  the  clockwise  listing  of  the  vertices  of 
a  polygon,  and  any  three  consecutive  points  form  a  right  turn  or,  because  of 
the  rounding  errors,  they  almost  form  a  right  turn.  Moreover,  no  point  in  P 
can  be  far  outside  the  computed  hull.  The  only  problem  that  can  still  occur  is 
that,  when  three  points  lie  very  close  together,  a  turn  that  is  actually  a  sharp 
left  turn  can  be  interpretated  as  a  right  turn.  This  might  result  in  a  dent  in  the 
resulting  polygon.  A  way  out  of  this  is  to  make  sure  that  points  in  the  input 
that  are  very  close  together  are  considered  as  being  the  same  point,  for  example 
by  rounding.  Hence,  although  the  result  need  not  be  exactly  correct — but  then, 
we  cannot  hope  for  an  exact  result  if  we  use  inexact  arithmetic — it  does  make 
sense.  For  many  applications  this  is  good  enough.  Still,  it  is  wise  to  be  careful 
in  the  implementation  of  the  basic  test  to  avoid  errors  as  much  as  possible. 
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We  conclude  with  the  following  theorem: 

Theorem  1.1  The  convex  hull  of  a  set  ofn  points  in  the  plane  can  be  computed 
in  O(nlogn)  time. 

Proof.  We  will  prove  the  correctness  of  the  computation  of  the  upper  hull;  the 
lower  hull  computation  can  be  proved  correct  using  similar  arguments.  The 
proof  is  by  induction  on  the  number  of  point  treated.  Before  the  for-loop  starts, 
the  list  £ Upper  contains  the  points  p\  and  p2,  which  trivially  form  the  upper 
hull  of  {pi.pi}.  Now  suppose  that  £Upper  contains  the  upper  hull  vertices 
of  {pi, . . .  ,Pi-i}  and  consider  the  addition  of  p,.  After  the  execution  of  the 
while-loop  and  because  of  the  induction  hypothesis,  we  know  that  the  points  in 
£  upper  form  a  chain  that  only  makes  right  turns.  Moreover,  the  chain  starts  at  the 
lexicographically  smallest  point  of  {p\ , . . . ,/?,}  and  ends  at  the  lexicographically 
largest  point,  namely  pt.  If  we  can  show  that  all  points  of  {pi, . . that  are 
not  in  £  upper  are  below  the  chain,  then  £Upper  contains  the  correct  points.  By 
induction  we  know  there  is  no  point  above  the  chain  that  we  had  before  /;,  was 
added.  Since  the  old  chain  lies  below  the  new  chain,  the  only  possibility  for  a 
point  to  lie  above  the  new  chain  is  if  it  lies  in  the  vertical  slab  between  p,_i  and 
Pi.  But  this  is  not  possible,  since  such  a  point  would  be  in  between  p,-_  \  and  p , 
in  the  lexicographical  order.  (You  should  verify  that  a  similar  argument  holds  if 
Pi- 1  and  pi.  or  any  other  points,  have  the  same  v-coordinate.) 

To  prove  the  time  bound,  we  note  that  sorting  the  points  lexicographically 
can  be  done  in  0{n logn)  time.  Now  consider  the  computation  of  the  upper  hull. 
The  for-loop  is  executed  a  linear  number  of  times.  The  question  that  remains 
is  how  often  the  while-loop  inside  it  is  executed.  For  each  execution  of  the 
for-loop  the  while-loop  is  executed  at  least  once.  For  any  extra  execution  a 
point  is  deleted  from  the  current  hull.  As  each  point  can  be  deleted  only  once 
during  the  construction  of  the  upper  hull,  the  total  number  of  extra  executions 
over  all  for-loops  is  bounded  by  n.  Similarly,  the  computation  of  the  lower  hull 
takes  0(n)  time.  Due  to  the  sorting  step,  the  total  time  required  for  computing 
the  convex  hull  is  <9(nlogn).  0 

The  final  convex  hull  algorithm  is  simple  to  describe  and  easy  to  implement. 
It  only  requires  lexicographic  sorting  and  a  test  whether  three  consecutive  points 
make  a  right  turn.  From  the  original  definition  of  the  problem  it  was  far  from 
obvious  that  such  an  easy  and  efficient  solution  would  exist. 


1.2  Degeneracies  and  Robustness 

As  we  have  seen  in  the  previous  section,  the  development  of  a  geometric 
algorithm  often  goes  through  three  phases. 

In  the  first  phase,  we  try  to  ignore  everything  that  will  clutter  our  understanding 
of  the  geometric  concepts  we  are  dealing  with.  Sometimes  collinear  points  are 
a  nuisance,  sometimes  vertical  line  segments  are.  When  first  trying  to  design  or 
understand  an  algorithm,  it  is  often  helpful  to  ignore  these  degenerate  cases. 


In  the  second  phase,  we  have  to  adjust  the  algorithm  designed  in  the  first  phase  Section  1.2 

to  be  correct  in  the  presence  of  degenerate  cases.  Beginners  tend  to  do  this  degeneracies  and  robustness 

by  adding  a  huge  number  of  case  distinctions  to  their  algorithms.  In  many 
situations  there  is  a  better  way.  By  considering  the  geometry  of  the  problem 
again,  one  can  often  integrate  special  cases  with  the  general  case.  For  example, 
in  the  convex  hull  algorithm  we  only  had  to  use  the  lexicographical  order  instead 
of  the  order  on  x-coordinate  to  deal  with  points  with  equal  x-coordinate.  For 
most  algorithms  in  this  book  we  have  tried  to  take  this  integrated  approach  to 
deal  with  special  cases.  Still,  it  is  easier  not  to  think  about  such  cases  upon  first 
reading.  Only  after  understanding  how  the  algorithm  works  in  the  general  case 
should  you  think  about  degeneracies. 

If  you  study  the  computational  geometry  literature,  you  will  find  that  many 
authors  ignore  special  cases,  often  by  formulating  specific  assumptions  on  the 
input.  For  example,  in  the  convex  hull  problem  we  could  have  ignored  special 
cases  by  simply  stating  that  we  assume  that  the  input  is  such  that  no  three 
points  are  collinear  and  no  two  points  have  the  same  x-coordinate.  From  a 
theoretical  point  of  view,  such  assumptions  are  usually  justified:  the  goal  is 
then  to  establish  the  computational  complexity  of  a  problem  and,  although  it  is 
tedious  to  work  out  the  details,  degenerate  cases  can  almost  always  be  handled 
without  increasing  the  asymptotic  complexity  of  the  algorithm.  But  special  cases 
definitely  increase  the  complexity  of  the  implementations.  Most  researchers  in 
computational  geometry  today  are  aware  that  their  general  position  assumptions 
are  not  satisfied  in  practical  applications  and  that  an  integrated  treatment  of  the 
special  cases  is  normally  the  best  way  to  handle  them.  Furthermore,  there  are 
general  techniques — so-called  symbolic  perturbation  schemes — that  allow  one 
to  ignore  special  cases  during  the  design  and  implementation,  and  still  have  an 
algorithm  that  is  correct  in  the  presence  of  degeneracies. 

The  third  phase  is  the  actual  implementation.  Now  one  needs  to  think  about 
the  primitive  operations,  like  testing  whether  a  point  lies  to  the  left,  to  the  right, 
or  on  a  directed  line.  If  you  are  lucky  you  have  a  geometric  software  library 
available  that  contains  the  operations  you  need,  otherwise  you  must  implement 
them  yourself. 

Another  issue  that  arises  in  the  implementation  phase  is  that  the  assumption 
of  doing  exact  arithmetic  with  real  numbers  breaks  down,  and  it  is  necessary 
to  understand  the  consequences.  Robustness  problems  are  often  a  cause  of 
frustration  when  implementing  geometric  algorithms.  Solving  robustness  prob¬ 
lems  is  not  easy.  One  solution  is  to  use  a  package  providing  exact  arithmetic 
(using  integers,  rationals,  or  even  algebraic  numbers,  depending  on  the  type 
of  problem)  but  this  will  be  slow.  Alternatively,  one  can  adapt  the  algorithm 
to  detect  inconsistencies  and  take  appropriate  actions  to  avoid  crashing  the 
program.  In  this  case  it  is  not  guaranteed  that  the  algorithm  produces  the  correct 
output,  and  it  is  important  to  establish  the  exact  properties  that  the  output  has. 

This  is  what  we  did  in  the  previous  section,  when  we  developed  the  convex 
hull  algorithm:  the  result  might  not  be  a  convex  polygon  but  we  know  that  the 
structure  of  the  output  is  correct  and  that  the  output  polygon  is  very  close  to  the 
convex  hull.  Finally,  it  is  possible  to  predict,  based  on  the  input,  the  precision  in 
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the  number  representation  required  to  solve  the  problem  correctly. 

Which  approach  is  best  depends  on  the  application.  If  speed  is  not  an  issue, 
exact  arithmetic  is  preferred.  In  other  cases  it  is  not  so  important  that  the  result 
of  the  algorithm  is  precise.  For  example,  when  displaying  the  convex  hull  of  a 
set  of  points,  it  is  most  likely  not  noticeable  when  the  polygon  deviates  slightly 
from  the  true  convex  hull.  In  this  case  we  can  use  a  careful  implementation 
based  on  floating  point  arithmetic. 

In  the  rest  of  this  book  we  focus  on  the  design  phase  of  geometric  algorithms; 
we  won’t  say  much  about  the  problems  that  arise  in  the  implementation  phase. 


1.3  Application  Domains 

As  indicated  before,  we  have  chosen  a  motivating  example  application  for  every 
geometric  concept,  algorithm,  or  data  structure  introduced  in  this  book.  Most  of 
the  applications  stem  from  the  areas  of  computer  graphics,  robotics,  geographic 
information  systems,  and  CAD/CAM.  For  those  not  familiar  with  these  fields, 
we  give  a  brief  description  of  the  areas  and  indicate  some  of  the  geometric 
problems  that  arise  in  them. 

Computer  graphics.  Computer  graphics  is  concerned  with  creating  images 
of  modeled  scenes  for  display  on  a  computer  screen,  a  printer,  or  other  output 
device.  The  scenes  vary  from  simple  two-dimensional  drawings — consisting  of 
lines,  polygons,  and  other  primitive  objects — to  realistic-looking  3-dimensional 
scenes  including  light  sources,  textures,  and  so  on.  The  latter  type  of  scene  can 
easily  contain  over  a  million  polygons  or  curved  surface  patches. 

Because  scenes  consist  of  geometric  objects,  geometric  algorithms  play  an 
important  role  in  computer  graphics. 

For  2-dimensional  graphics,  typical  questions  involve  the  intersection  of 
certain  primitives,  determining  the  primitive  pointed  to  with  the  mouse,  or  deter¬ 
mining  the  subset  of  primitives  that  lie  within  a  particular  region.  Chapters  6,  10, 
and  16  describe  techniques  useful  for  some  of  these  problems. 

When  dealing  with  3-dimensional  problems  the  geometric  questions  be¬ 
come  more  complex.  A  crucial  step  in  displaying  a  3-dimensional  scene  is 
hidden  surface  removal:  determine  the  part  of  a  scene  visible  from  a  particular 
viewpoint  or,  in  other  words,  discard  the  parts  that  lie  behind  other  objects.  In 
Chapter  12  we  study  one  approach  to  this  problem. 

To  create  realistic-looking  scenes  we  have  to  take  light  into  account.  This 
creates  many  new  problems,  such  as  the  computation  of  shadows.  Hence, 
realistic  image  synthesis  requires  complicated  display  techniques,  like  ray 
tracing  and  radiosity.  When  dealing  with  moving  objects  and  in  virtual  reality 
applications,  it  is  important  to  detect  collisions  between  objects.  All  these 
situations  involve  geometric  problems. 
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Robotics.  The  field  of  robotics  studies  the  design  and  use  of  robots.  As  robots 
are  geometric  objects  that  operate  in  a  3-dimensional  space — the  real  world — it 


is  obvious  that  geometric  problems  arise  at  many  places.  At  the  beginning  of 
this  chapter  we  already  introduced  the  motion  planning  problem,  where  a  robot 
has  to  find  a  path  in  an  environment  with  obstacles.  In  Chapters  13  and  15  we 
study  some  simple  cases  of  motion  planning.  Motion  planning  is  one  aspect 
of  the  more  general  problem  of  task  planning.  One  would  like  to  give  a  robot 
high-level  tasks — “vacuum  the  room’’ — and  let  the  robot  figure  out  the  best  way 
to  execute  the  task.  This  involves  planning  motions,  planning  the  order  in  which 
to  perform  subtasks,  and  so  on. 

Other  geometric  problems  occur  in  the  design  of  robots  and  work  cells  in 
which  the  robot  has  to  operate.  Most  industrial  robots  are  robot  arms  with  a 
fixed  base.  The  parts  operated  on  by  the  robot  arm  have  to  be  supplied  in  such 
a  way  that  the  robot  can  easily  grasp  them.  Some  of  the  parts  may  have  to  be 
immobilized  so  that  the  robot  can  work  on  them.  They  may  also  have  to  be 
turned  to  a  known  orientation  before  the  robot  can  work  on  them.  These  are 
all  geometric  problems,  sometimes  with  a  kinematic  component.  Some  of  the 
algorithms  described  in  this  book  are  applicable  in  such  problems.  For  example, 
the  smallest  enclosing  disc  problem,  treated  in  Section  4.7,  can  be  used  for 
optimal  placement  of  robot  arms. 


Geographic  information  systems.  A  geographic  information  system,  or  GIS 
for  short,  stores  geographical  data  like  the  shape  of  countries,  the  height  of 
mountains,  the  course  of  rivers,  the  type  of  vegetation  at  different  locations, 
population  density,  or  rainfall.  They  can  also  store  human-made  structures 
such  as  cities,  roads,  railways,  electricity  lines,  or  gas  pipes.  A  GIS  can  be 
used  to  extract  information  about  certain  regions  and,  in  particular,  to  obtain 
information  about  the  relation  between  different  types  of  data.  For  example, 
a  biologist  may  want  to  relate  the  average  rainfall  to  the  existence  of  certain 
plants,  and  a  civil  engineer  may  need  to  query  a  GIS  to  determine  whether  there 
are  any  gas  pipes  underneath  a  lot  where  excavation  works  are  to  be  performed. 

As  most  geographic  information  concerns  properties  of  points  and  regions 
on  the  earth’s  surface,  geometric  problems  occur  in  abundance  here.  Moreover, 
the  amount  of  data  is  so  large  that  efficient  algorithms  are  a  must.  Below  we 
mention  the  GIS -related  problems  treated  in  this  book. 

A  first  question  is  how  to  store  geographic  data.  Suppose  that  we  want  to 
develop  a  car  guidance  system,  which  shows  the  driver  at  any  moment  where  she 
is.  This  requires  storing  a  huge  map  of  roads  and  other  data.  At  every  moment 
we  have  to  be  able  to  determine  the  position  of  the  car  on  the  map  and  to  quickly 
select  a  small  portion  of  the  map  for  display  on  the  on-board  computer.  Efficient 
data  structures  are  needed  for  these  operations.  Chapters  6,  10,  and  16  describe 
computational  geometry  solutions  to  these  problems. 

The  information  about  the  height  in  some  mountainous  terrain  is  usually 
only  available  at  certain  sample  points.  For  other  positions  we  have  to  obtain 
the  heights  by  interpolating  between  nearby  sample  points.  But  which  sample 
points  should  we  choose?  Chapter  9  deals  with  this  problem. 

The  combination  of  different  types  of  data  is  one  of  the  most  important 
operations  in  a  GIS.  For  example,  we  may  want  to  check  which  houses  lie  in 
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a  forest,  locate  all  bridges  by  checking  where  roads  cross  rivers,  or  determine 
a  good  location  for  a  new  golf  course  by  finding  a  slightly  hilly,  rather  cheap 
area  not  too  far  from  a  particular  town.  A  GIS  usually  stores  different  types  of 
data  in  separate  maps.  To  combine  the  data  we  have  to  overlay  different  maps. 
Chapter  2  deals  with  a  problem  arising  when  we  want  to  compute  the  overlay. 

Finally,  we  mention  the  same  example  we  gave  at  the  beginning  of  this 
chapter:  the  location  of  the  nearest  public  phone  (or  hospital,  or  any  other 
facility).  This  requires  the  computation  of  a  Voronoi  diagram,  a  structure 
studied  in  detail  in  Chapter  7. 

CAD/CAM.  Computer  aided  design  (CAD)  concerns  itself  with  the  design 
of  products  with  a  computer.  The  products  can  vary  from  printed  circuit  boards, 
machine  parts,  or  furniture,  to  complete  buildings.  In  all  cases  the  resulting 
product  is  a  geometric  entity  and,  hence,  it  is  to  be  expected  that  all  sorts  of 
geometric  problems  appear.  Indeed,  CAD  packages  have  to  deal  with  intersec¬ 
tions  and  unions  of  objects,  with  decomposing  objects  and  object  boundaries 
into  simpler  shapes,  and  with  visualizing  the  designed  products. 

To  decide  whether  a  design  meets  the  specifications  certain  tests  are  needed. 
Often  one  does  not  need  to  build  a  prototype  for  these  tests,  and  a  simulation 
suffices.  Chapter  14  deals  with  a  problem  arising  in  the  simulation  of  heat 
emission  by  a  printed  circuit  board. 

Once  an  object  has  been  designed  and  tested,  it  has  to  be  manufactured. 
Computer  aided  manufacturing  (CAM)  packages  can  be  of  assistance  here. 
CAM  involves  many  geometric  problems.  Chapter  4  studies  one  of  them. 

A  recent  trend  is  design  for  assembly ,  where  assembly  decisions  are  already 
taken  into  account  during  the  design  stage.  A  CAD  system  supporting  this 
would  allow  designers  to  test  their  design  for  feasibility,  answering  questions 
like:  can  the  product  be  built  easily  using  a  certain  manufacturing  process? 
Many  of  these  questions  require  geometric  algorithms  to  be  answered. 

Other  applications  domains.  There  are  many  more  application  domains 
where  geometric  problems  occur  and  geometric  algorithms  and  data  structures 
can  be  used  to  solve  them. 

For  example,  in  molecular  modeling,  molecules  are  often  represented  by 
collections  of  intersecting  balls  in  space,  one  ball  for  each  atom.  Typical 
questions  are  to  compute  the  union  of  the  atom  balls  to  obtain  the  molecule 
surface,  or  to  compute  where  two  molecules  can  touch  each  other. 

Another  area  is  pattern  recognition.  Consider  for  example  an  optical  char¬ 
acter  recognition  system.  Such  a  system  scans  a  paper  with  text  on  it  with  the 
goal  of  recognizing  the  text  characters.  A  basic  step  is  to  match  the  image  of  a 
character  against  a  collection  of  stored  characters  to  find  the  one  that  best  fits  it. 
This  leads  to  a  geometric  problem:  given  two  geometric  objects,  determine  how 
well  they  resemble  each  other. 

Even  certain  areas  that  at  first  sight  do  not  seem  to  be  geometric  can  ben¬ 
efit  from  geometric  algorithms,  because  it  is  often  possible  to  formulate  non¬ 
geometric  problem  in  geometric  terms.  In  Chapter  5,  for  instance,  we  will  see 


how  records  in  a  database  can  be  interpreted  as  points  in  a  higher-dimensional  Section  1.4 

space,  and  we  will  present  a  geometric  data  structure  such  that  certain  queries  notes  and  comments 

on  the  records  can  be  answered  efficiently. 

We  hope  that  the  above  collection  of  geometric  problems  makes  it  clear  that 
computational  geometry  plays  a  role  in  many  different  areas  of  computer  sci¬ 
ence.  The  algorithms,  data  structures,  and  techniques  described  in  this  book 
will  provide  you  with  the  tools  needed  to  attack  such  geometric  problems 
successfully. 


1.4  Notes  and  Comments 

Every  chapter  of  this  book  ends  with  a  section  entitled  Notes  and  Comments. 

These  sections  indicate  where  the  results  described  in  the  chapter  came  from, 
indicate  generalizations  and  improvements,  and  provide  references.  They  can 
be  skipped  but  do  contain  useful  material  for  those  who  want  to  know  more 
about  the  topic  of  the  chapter.  More  information  can  also  be  found  in  the 
Handbook  of  Computational  Geometry  [331]  and  the  Handbook  of  Discrete  and 
Computational  Geometry  [191], 

In  this  chapter  the  geometric  problem  treated  in  detail  was  the  computation 
of  the  convex  hull  of  a  set  of  points  in  the  plane.  This  is  a  classic  topic  in 
computational  geometry  and  the  amount  of  literature  about  it  is  huge.  The 
algorithm  described  in  this  chapter  is  commonly  known  as  Graham ’s  scan ,  and 
is  based  on  a  modification  by  Andrew  [17]  of  one  of  the  earliest  algorithms  by 
Graham  [192].  This  is  only  one  of  the  many  O(nlogn)  algorithms  available  for 
solving  the  problem.  A  divide-and-conquer  approach  was  given  by  Preparata 
and  Hong  [322].  Also  an  incremental  method  exists  that  inserts  the  points 
one  by  one  in  O(\ogn)  time  per  insertion  [321].  Overmars  and  van  Leeuwen 
generalized  this  to  a  method  in  which  points  could  be  both  inserted  and  deleted 
in  (9 (log 2  n)  time  [305].  Other  results  on  dynamic  convex  hulls  were  obtained 
by  Hershberger  and  Suri  [211],  Chan  [83],  and  Brodal  and  Jacob  [73]. 

Even  though  an  Q.(nlogn)  lower  bound  is  known  for  the  problem  [393] 
many  authors  have  tried  to  improve  the  result.  This  makes  sense  because  in  many 
applications  the  number  of  points  that  appear  on  the  convex  hull  is  relatively 
small,  while  the  lower  bound  result  assumes  that  (almost)  all  points  show  up 
on  the  convex  hull.  Hence,  it  is  useful  to  look  at  algorithms  whose  running 
time  depends  on  the  complexity  of  the  convex  hull.  Jarvis  [221]  introduced 
a  wrapping  technique,  often  referred  to  as  Jarvis ’s  march,  that  computes  the 
convex  hull  in  0(li  ■  n)  time  where  h  is  the  complexity  of  the  convex  hull.  The 
same  worst-case  performance  is  achieved  by  the  algorithm  of  Overmars  and 
van  Leeuwen  [303],  based  on  earlier  work  by  Bykat  [79],  Eddy  [156],  and 
Green  and  Silverman  [193].  This  algorithm  has  the  advantage  that  its  expected 
running  time  is  linear  for  many  distributions  of  points.  Finally,  Kirkpatrick 

and  Seidel  [238]  improved  the  result  to  O(n\ogh),  and  recently  Chan  [82]  _ 

discovered  a  much  simpler  algorithm  to  achieve  the  same  result.  13 
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The  convex  hull  can  be  defined  in  any  dimension.  Convex  hulls  in  3- 
dimensional  space  can  still  be  computed  in  O(nlogn)  time,  as  we  will  see  in 
Chapter  1 1 .  For  dimensions  higher  than  3,  however,  the  complexity  of  the 
convex  hull  is  no  longer  linear  in  the  number  of  points.  See  the  notes  and 
comments  of  Chapter  1 1  for  more  details. 

In  the  past  years  a  number  of  general  methods  for  handling  special  cases  have 
been  suggested.  These  symbolic  perturbation  schemes  perturb  the  input  in  such 
a  way  that  all  degeneracies  disappear.  However,  the  perturbation  is  only  done 
symbolically.  This  technique  was  introduced  by  Edelsbrunner  and  Miicke  [164] 
and  later  refined  by  Yap  [397]  and  Emiris  and  Canny  [172,  171].  Symbolic 
perturbation  relieves  the  programmer  of  the  burden  of  degeneracies,  but  it 
has  some  drawbacks:  the  use  of  a  symbolic  perturbation  library  slows  down 
the  algorithm,  and  sometimes  one  needs  to  recover  the  “real  result’’  from  the 
“perturbed  result”,  which  is  not  always  easy.  These  drawbacks  led  Burnikel  et 
al.  [78]  to  claim  that  it  is  both  simpler  (in  terms  of  programming  effort)  and 
more  efficient  (in  terms  of  running  time)  to  deal  directly  with  degenerate  inputs. 

Robustness  in  geometric  algorithms  is  a  topic  that  has  recently  received  a  lot  of 
interest.  Most  geometric  comparisons  can  be  formulated  as  computing  the  sign 
of  some  determinant.  A  possible  way  to  deal  with  the  inexactness  in  floating 
point  arithmetic  when  evaluating  this  sign  is  to  choose  a  small  threshold  value 
£  and  to  say  that  the  determinant  is  zero  when  the  outcome  of  the  floating 
point  computation  is  less  than  e.  When  implemented  naively,  this  can  lead  to 
inconsistencies  (for  instance,  for  three  points  a,b,c  we  may  decide  that  a  =  b 
and  b  =  c  but  a^c)  that  cause  the  program  to  fail.  Guibas  et  al.  [198]  showed 
that  combining  such  an  approach  with  interval  arithmetic  and  backwards  error 
analysis  can  give  robust  algorithms.  Another  option  is  to  use  exact  arithmetic. 
Here  one  computes  as  many  bits  of  the  determinant  as  are  needed  to  determine  its 
sign.  This  will  slow  down  the  computation,  but  techniques  have  been  developed 
to  keep  the  performance  penalty  relatively  small  [182,  256,  395].  Besides 
these  general  approaches,  there  have  been  a  number  papers  dealing  with  robust 
computation  in  specific  problems  [34,  37,  81,  145,  180,  181,  219,  279]. 

We  gave  a  brief  overview  of  the  application  domains  from  which  we  took  our 
examples,  which  serve  to  show  the  motivation  behind  the  various  geometric 
notions  and  algorithms  studied  in  this  book.  Below  are  some  references  to 
textbooks  you  can  consult  if  you  want  to  know  more  about  the  application 
domains.  Of  course  there  are  many  more  good  books  about  these  domains  than 
the  few  we  mention. 

There  is  a  large  number  of  books  on  computer  graphics.  The  book  by  Foley 
et  al.  [179]  is  very  extensive  and  generally  considered  one  of  the  best  books  on 
the  topic.  Other  good  books  are  the  ones  by  Shirley  et  al.  [359]  and  Watt  [381]. 

An  extensive  overview  of  robotics  and  the  motion  planning  problem  can  be 
found  in  the  book  of  Choset  et  al.  [127],  and  in  the  somewhat  older  books  of 
Latombe  [243]  and  Hopcroft,  Schwartz,  and  Sharir  [217].  More  information  on 
geometric  aspects  of  robotics  is  provided  by  the  book  of  Selig  [348]. 
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There  is  a  large  collection  of  books  about  geographic  information  systems.  Section  1.5 
but  most  of  them  do  not  consider  algorithmic  issues  in  much  detail.  Some  exercises 
general  textbooks  are  the  ones  by  DeMers  [140],  Longley  et  al.  [257],  and 
Worboys  and  Duckham  [392].  Data  structures  for  spatial  data  are  described 
extensively  in  the  book  of  Samet  [335], 

The  books  by  Faux  and  Pratt  [175],  Mortenson  [285],  and  Hoffmann  [216] 
are  good  introductory  texts  on  CAD/CAM  and  geometric  modeling. 


1.5  Exercises 


1.1  The  convex  hull  of  a  set  S  is  defined  to  be  the  intersection  of  all  convex 
sets  that  contain  S.  For  the  convex  hull  of  a  set  of  points  it  was  indicated 
that  the  convex  hull  is  the  convex  set  with  smallest  perimeter.  We  want  to 
show  that  these  are  equivalent  definitions. 

a.  Prove  that  the  intersection  of  two  convex  sets  is  again  convex.  This 
implies  that  the  intersection  of  a  finite  family  of  convex  sets  is  convex 
as  well. 

b.  Prove  that  the  smallest  perimeter  polygon  CP  containing  a  set  of  points 
P  is  convex. 

c.  Prove  that  any  convex  set  containing  the  set  of  points  P  contains  the 
smallest  perimeter  polygon  CP. 

1 .2  Let  P  be  a  set  of  points  in  the  plane.  Let  CP  be  the  convex  polygon  whose 
vertices  are  points  from  P  and  that  contains  all  points  in  P.  Prove  that  this 
polygon  CP  is  uniquely  defined,  and  that  it  is  the  intersection  of  all  convex 
sets  containing  P. 

1 .3  Let  E  be  an  unsorted  set  of  n  segments  that  are  the  edges  of  a  convex 
polygon.  Describe  an  O(nlogn)  algorithm  that  computes  from  E  a  list 
containing  all  vertices  of  the  polygon,  sorted  in  clockwise  order. 


1 .4  For  the  convex  hull  algorithm  we  have  to  be  able  to  test  whether  a  point 
r  lies  left  or  right  of  the  directed  line  through  two  points  p  and  q.  Let 
P  =  (Px,Py),  q  =  (qx,qy),  and  r  =  (rx, ry). 

a.  Show  that  the  sign  of  the  determinant 


D  = 


1  Px  Py 

i  qx  qy 

1  rx  I'y 


determines  whether  r  lies  left  or  right  of  the  line. 

b.  Show  that  |Z)|  in  fact  is  twice  the  surface  of  the  triangle  determined  by 
p,  q,  and  r. 

c.  Why  is  this  an  attractive  way  to  implement  the  basic  test  in  algorithm 
ConvexHull?  Give  an  argument  for  both  integer  and  floating  point 
coordinates. 
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Chapter  1  1-5  Verify  that  the  algorithm  CONVEXHull  with  the  indicated  modifications 

computational  geometry  correctly  computes  the  convex  hull,  also  of  degenerate  sets  of  points. 

Consider  for  example  such  nasty  cases  as  a  set  of  points  that  all  lie  on  one 
(vertical)  line. 

1.6  In  many  situations  we  need  to  compute  convex  hulls  of  objects  other  than 
points. 

a.  Let  S  be  a  set  of  n  line  segments  in  the  plane.  Prove  that  the  convex 
hull  of  S  is  exactly  the  same  as  the  convex  hull  of  the  2 n  endpoints  of 
the  segments. 

b. *  Let  (P  be  a  non-convex  polygon.  Describe  an  algorithm  that  computes 
the  convex  hull  of  CP  in  0(n)  time.  Hint:  Use  a  variant  of  algorithm 
ConvexHull  where  the  vertices  are  not  treated  in  lexicographical 
order,  but  in  some  other  order. 

1 .7  Consider  the  following  alternative  approach  to  computing  the  convex  hull 
of  a  set  of  points  in  the  plane:  We  start  with  the  rightmost  point.  This  is 
the  first  point  p\  of  the  convex  hull.  Now  imagine  that  we  start  with  a 
vertical  line  and  rotate  it  clockwise  until  it  hits  another  point  pi.  This  is 
the  second  point  on  the  convex  hull.  We  continue  rotating  the  line  but  this 
time  around  pi  until  we  hit  a  point  pi .  In  this  way  we  continue  until  we 
reach  p\  again. 

a.  Give  pseudocode  for  this  algorithm. 

b.  What  degenerate  cases  can  occur  and  how  can  we  deal  with  them? 

c.  Prove  that  the  algorithm  correctly  computes  the  convex  hull. 

d.  Prove  that  the  algorithm  can  be  implemented  to  run  in  time  0(n  ■  h), 
where  h  is  the  complexity  of  the  convex  hull. 

e.  What  problems  might  occur  when  we  deal  with  inexact  floating  point 
arithmetic? 

1.8  The  O(nlogn)  algorithm  to  compute  the  convex  hull  of  a  set  of  n  points 
in  the  plane  that  was  described  in  this  chapter  is  based  on  the  paradigm 
of  incremental  construction:  add  the  points  one  by  one,  and  update  the 
convex  hull  after  each  addition.  In  this  exercise  we  shall  develop  an 
algorithm  based  on  another  paradigm,  namely  divide-and-conquer. 

a.  Let  (P]  and  TS  be  two  disjoint  convex  polygons  with  n  vertices  in  total. 
Give  an  0(n)  time  algorithm  that  computes  the  convex  hull  of  Ti  U  ‘J>2- 

b.  Use  the  algorithm  from  part  a  to  develop  an  O(n\ogn)  time  divide-and- 
conquer  algorithm  to  compute  the  convex  hull  of  a  set  of  n  points  in 
the  plane. 

1.9  Suppose  that  we  have  a  subroutine  ConvexHull  available  for  comput¬ 
ing  the  convex  hull  of  a  set  of  points  in  the  plane.  Its  output  is  a  list  of  con¬ 
vex  hull  vertices,  sorted  in  clockwise  order.  Now  let  S  =  {x\  ,xi,  ■  . .  ,x„} 
be  a  set  of  n  numbers.  Show  that  S  can  be  sorted  in  0(n)  time  plus  the 
time  needed  for  one  call  to  ConvexHull.  Since  the  sorting  problem 
has  an  Cl(n\ogn)  lower  bound,  this  implies  that  the  convex  hull  problem 
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has  an  £2(n  log  n)  lower  bound  as  well.  Hence,  the  algorithm  presented  in 
this  chapter  is  asymptotically  optimal. 


Section  1.5 

EXERCISES 


1.10  Let  S  be  a  set  of  n  (possibly  intersecting)  unit  circles  in  the  plane.  We 
want  to  compute  the  convex  hull  of  S. 

a.  Show  that  the  boundary  of  the  convex  hull  of  S  consists  of  straight 
line  segments  and  pieces  of  circles  in  S. 

b.  Show  that  each  circle  can  occur  at  most  once  on  the  boundary  of  the 
convex  hull. 

c.  Let  S'  be  the  set  of  points  that  are  the  centers  of  the  circles  in  S.  Show 
that  a  circle  in  S  appears  on  the  boundary  of  the  convex  hull  if  and 
only  if  the  center  of  the  circle  lies  on  the  convex  hull  of  S'. 

d.  Give  an  O(nlogn)  algorithm  for  computing  the  convex  hull  of  S. 

e. *  Give  an  0(n  log/;)  algorithm  for  the  case  in  which  the  circles  in  S 

have  different  radii. 
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2  Line  Segment  Intersection 

Thematic  Map  Overlay 


When  you  are  visiting  a  country,  maps  are  an  invaluable  source  of  information. 
They  tell  you  where  tourist  attractions  are  located,  they  indicate  the  roads  and 
railway  lines  to  get  there,  they  show  small  lakes,  and  so  on.  Unfortunately, 
they  can  also  be  a  source  of  frustration,  as  it  is  often  difficult  to  find  the  right 
information:  even  when  you  know  the  approximate  position  of  a  small  town, 
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Figure  2. 1 

Cities,  rivers,  railroads,  and  their 
overlay  in  western  Canada 


it  can  still  be  difficult  to  spot  it  on  the  map.  To  make  maps  more  readable, 
geographic  information  systems  split  them  into  several  layers.  Each  layer  is  a 
thematic  map,  that  is,  it  stores  only  one  type  of  information.  Thus  there  will 
be  a  layer  storing  the  roads,  a  layer  storing  the  cities,  a  layer  storing  the  rivers. 
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and  so  on.  The  theme  of  a  layer  can  also  be  more  abstract.  For  instance,  there 
could  be  a  layer  for  the  population  density,  for  average  precipitation,  habitat  of 
the  grizzly  bear,  or  for  vegetation.  The  type  of  geometric  information  stored 
in  a  layer  can  be  very  different:  the  layer  for  a  road  map  could  store  the  roads 
as  collections  of  line  segments  (or  curves,  perhaps),  the  layer  for  cities  could 
contain  points  labeled  with  city  names,  and  the  layer  for  vegetation  could  store 
a  subdivision  of  the  map  into  regions  labeled  with  the  type  of  vegetation. 

Users  of  a  geographic  information  system  can  select  one  of  the  thematic 
maps  for  display.  To  find  a  small  town  you  would  select  the  layer  storing  cities, 
and  you  would  not  be  distracted  by  information  such  as  the  names  of  rivers 
and  lakes.  After  you  have  spotted  the  town,  you  probably  want  to  know  how  to 
get  there.  To  this  end  geographic  information  systems  allow  users  to  view  an 
overlay  of  several  maps — see  Figure  2.1.  Using  an  overlay  of  the  road  map  and 
the  map  storing  cities  you  can  now  figure  out  how  to  get  to  the  town.  When  two 
or  more  thematic  map  layers  are  shown  together,  intersections  in  the  overlay 
are  positions  of  special  interest.  For  example,  when  viewing  the  overlay  of 
the  layer  for  the  roads  and  the  layer  for  the  rivers,  it  would  be  useful  if  the 
intersections  were  clearly  marked.  In  this  example  the  two  maps  are  basically 
networks,  and  the  intersections  are  points.  In  other  cases  one  is  interested  in 
the  intersection  of  complete  regions.  For  instance,  geographers  studying  the 
climate  could  be  interested  in  finding  regions  where  there  is  pine  forest  and  the 
annual  precipitation  is  between  1000  mm  and  1500  mm.  These  regions  are  the 
intersections  of  the  regions  labeled  “pine  forest”  in  the  vegetation  map  and  the 
regions  labeled  “1000-1500”  in  the  precipitation  map. 


2.1  Line  Segment  Intersection 

We  first  study  the  simplest  form  of  the  map  overlay  problem,  where  the  two  map 
layers  are  networks  represented  as  collections  of  line  segments.  For  example,  a 
map  layer  storing  roads,  railroads,  or  rivers  at  a  small  scale.  Note  that  curves  can 
be  approximated  by  a  number  of  small  segments.  At  first  we  won’t  be  interested 
in  the  regions  induced  by  these  line  segments.  Later  we  shall  look  at  the  more 
complex  situation  where  the  maps  are  not  just  networks,  but  subdivisions  of 
the  plane  into  regions  that  have  an  explicit  meaning.  To  solve  the  network 
overlay  problem  we  first  have  to  state  it  in  a  geometric  setting.  For  the  overlay 
of  two  networks  the  geometric  situation  is  the  following:  given  two  sets  of 
line  segments,  compute  all  intersections  between  a  segment  from  one  set  and  a 
segment  from  the  other.  This  problem  specification  is  not  quite  precise  enough 
yet,  as  we  didn’t  define  when  two  segments  intersect.  In  particular,  do  two 
segments  intersect  when  an  endpoint  of  one  of  them  lies  on  the  other?  In  other 
words,  we  have  to  specify  whether  the  input  segments  are  open  or  closed.  To 
make  this  decision  we  should  go  back  to  the  application,  the  network  overlay 
problem.  Roads  in  a  road  map  and  rivers  in  a  river  map  are  represented  by 
chains  of  segments,  so  a  crossing  of  a  road  and  a  river  corresponds  to  the  interior 
of  one  chain  intersecting  the  interior  of  another  chain.  This  does  not  mean  that 


there  is  an  intersection  between  the  interior  of  two  segments:  the  intersection 
point  could  happen  to  coincide  with  an  endpoint  of  a  segment  of  a  chain.  In 
fact,  this  situation  is  not  uncommon  because  windy  rivers  are  represented  by 
many  small  segments  and  coordinates  of  endpoints  may  have  been  rounded 
when  maps  are  digitized.  We  conclude  that  we  should  define  the  segments  to  be 
closed,  so  that  an  endpoint  of  one  segment  lying  on  another  segment  counts  as 
an  intersection. 

To  simplify  the  description  somewhat  we  shall  put  the  segments  from  the  two 
sets  into  one  set,  and  compute  all  intersections  among  the  segments  in  that  set. 
This  way  we  certainly  find  all  the  intersections  we  want.  We  may  also  find 
intersections  between  segments  from  the  same  set.  Actually,  we  certainly  will, 
because  in  our  application  the  segments  from  one  set  form  a  number  of  chains, 
and  we  count  coinciding  endpoints  as  intersections.  These  other  intersections 
can  be  filtered  out  afterwards  by  simply  checking  for  each  reported  intersection 
whether  the  two  segments  involved  belong  to  the  same  set.  So  our  problem 
specification  is  as  follows:  given  a  set  S  of  n  closed  segments  in  the  plane,  report 
all  intersection  points  among  the  segments  in  S. 

This  doesn’t  seem  like  a  challenging  problem:  we  can  simply  take  each  pair 
of  segments,  compute  whether  they  intersect,  and,  if  so,  report  their  intersection 
point.  This  brute-force  algorithm  clearly  requires  0{n2)  time.  In  a  sense  this  is 
optimal:  when  each  pair  of  segments  intersects  any  algorithm  must  take  £2(n2) 
time,  because  it  has  to  report  all  intersections.  A  similar  example  can  be  given 
when  the  overlay  of  two  networks  is  considered.  In  practical  situations,  however, 
most  segments  intersect  no  or  only  a  few  other  segments,  so  the  total  number  of 
intersection  points  is  much  smaller  than  quadratic.  It  would  be  nice  to  have  an 
algorithm  that  is  faster  in  such  situations.  In  other  words,  we  want  an  algorithm 
whose  running  time  depends  not  only  on  the  number  of  segments  in  the  input, 
but  also  on  the  number  of  intersection  points.  Such  an  algorithm  is  called  an 
output-sensitive  algorithm:  the  running  time  of  the  algorithm  is  sensitive  to  the 
size  of  the  output.  We  could  also  call  such  an  algorithm  intersection-sensitive , 
since  the  number  of  intersections  is  what  determines  the  size  of  the  output. 

How  can  we  avoid  testing  all  pairs  of  segments  for  intersection?  Here  we 
must  make  use  of  the  geometry  of  the  situation:  segments  that  are  close  together 
are  candidates  for  intersection,  unlike  segments  that  are  far  apart.  Below  we 
shall  see  how  we  can  use  this  observation  to  obtain  an  output-sensitive  algorithm 
for  the  line  segment  intersection  problem. 

Let  S  :  =  {si , 52 ,  •  •  • , sn }  be  the  set  of  segments  for  which  we  want  to  compute 
all  intersections.  We  want  to  avoid  testing  pairs  of  segments  that  are  far  apart. 
But  how  can  we  do  this?  Let’s  first  try  to  rule  out  an  easy  case.  Define  the 
y-interval  of  a  segment  to  be  its  orthogonal  projection  onto  the  y-axis.  When  the 
y-intervals  of  a  pair  of  segments  do  not  overlap — we  could  say  that  they  are  far 
apart  in  the  y-direction — then  they  cannot  intersect.  Hence,  we  only  need  to  test 
pairs  of  segments  whose  y-intervals  overlap,  that  is,  pairs  for  which  there  exists 
a  horizontal  line  that  intersects  both  segments.  To  find  these  pairs  we  imagine 
sweeping  a  line  l  downwards  over  the  plane,  starting  from  a  position  above  all 
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Chapter  2  segments.  While  we  sweep  the  imaginary  line,  we  keep  track  of  all  segments 
line  segment  intersection  intersecting  it — the  details  of  this  will  be  explained  later — so  that  we  can  find 

the  pairs  we  need. 


event  point 


This  type  of  algorithm  is  called  a  plane  sweep  algorithm  and  the  line  t  is  called 
the  sweep  line.  The  status  of  the  sweep  line  is  the  set  of  segments  intersecting  it. 
The  status  changes  while  the  sweep  line  moves  downwards,  but  not  continuously. 
Only  at  particular  points  is  an  update  of  the  status  required.  We  call  these  points 
the  event  points  of  the  plane  sweep  algorithm.  In  this  algorithm  the  event  points 
are  the  endpoints  of  the  segments. 

The  moments  at  which  the  sweep  line  reaches  an  event  point  are  the  only 
moments  when  the  algorithm  actually  does  something:  it  updates  the  status  of 
the  sweep  line  and  performs  some  intersection  tests.  In  particular,  if  the  event 
point  is  the  upper  endpoint  of  a  segment,  then  a  new  segment  starts  intersecting 
the  sweep  line  and  must  be  added  to  the  status.  This  segment  is  tested  for 
intersection  against  the  ones  already  intersecting  the  sweep  line.  If  the  event 
point  is  a  lower  endpoint,  a  segment  stops  intersecting  the  sweep  line  and  must 
be  deleted  from  the  status.  This  way  we  only  test  pairs  of  segments  for  which 
there  is  a  horizontal  line  that  intersects  both  segments.  Unfortunately,  this  is 
not  enough:  there  are  still  situations  where  we  test  a  quadratic  number  of  pairs, 
whereas  there  is  only  a  small  number  of  intersection  points.  A  simple  example 
is  a  set  of  vertical  segments  that  all  intersect  the  x-axis.  So  the  algorithm  is  not 
output-sensitive.  The  problem  is  that  two  segments  that  intersect  the  sweep  line 
can  still  be  far  apart  in  the  horizontal  direction. 

Let’s  order  the  segments  from  left  to  right  as  they  intersect  the  sweep  line, 
to  include  the  idea  of  being  close  in  the  horizontal  direction.  We  shall  only 
test  segments  when  they  are  adjacent  in  the  horizontal  ordering.  This  means 
that  we  only  test  any  new  segment  against  two  segments,  namely,  the  ones 
immediately  left  and  right  of  the  upper  endpoint.  Later,  when  the  sweep  line  has 
moved  downwards  to  another  position,  a  segment  can  become  adjacent  to  other 
segments  against  which  it  will  be  tested.  Our  new  strategy  should  be  reflected  in 
the  status  of  our  algorithm:  the  status  now  corresponds  to  the  ordered  sequence 
of  segments  intersecting  the  sweep  line.  The  new  status  not  only  changes  at 
endpoints  of  segments;  it  also  changes  at  intersection  points,  where  the  order 
of  the  intersected  segments  changes.  When  this  happens  we  must  test  the  two 
segments  that  change  position  against  their  new  neighbors.  This  is  a  new  type 
of  event  point. 

Before  trying  to  turn  these  ideas  into  an  efficient  algorithm,  we  should 
convince  ourselves  that  the  approach  is  correct.  We  have  reduced  the  number 
of  pairs  to  be  tested,  but  do  we  still  find  all  intersections?  In  other  words,  if 
two  segments  s,  and  Sj  intersect,  is  there  always  a  position  of  the  sweep  line  £ 
where  s/  and  s  ;  are  adjacent  along  t!  Let’s  first  ignore  some  nasty  cases:  assume 
that  no  segment  is  horizontal,  that  any  two  segments  intersect  in  at  most  one 
point — they  do  not  overlap — ,  and  that  no  three  segments  meet  in  a  common 
point.  Later  we  shall  see  that  these  cases  are  easy  to  handle,  but  for  now  it 
is  convenient  to  forget  about  them.  The  intersections  where  an  endpoint  of  a 
segment  lies  on  another  segment  can  easily  be  detected  when  the  sweep  line 
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reaches  the  endpoint.  So  the  only  question  is  whether  intersections  between  the 
interiors  of  segments  are  always  detected. 

Lemma  2.1  Let  Si  and  s  ;  be  two  non-horizontal  segments  whose  interiors 
intersect  in  a  single  point  p,  and  assume  there  is  no  third  segment  passing 
through  p.  Then  there  is  an  event  point  above  p  where  Sj  and  sj  become 
adjacent  and  are  tested  for  intersection. 

Proof.  Let  £  be  a  horizontal  line  slightly  above  p.  If  £  is  close  enough  to  p  then 
Si  and  Sj  must  be  adjacent  along  i.  (To  be  precise,  we  should  take  t  such  that 
there  is  no  event  point  on  l,  nor  in  between  £  and  the  horizontal  line  through 
p.)  In  other  words,  there  is  a  position  of  the  sweep  line  where  ,y,  and  ,v;  are 
adjacent.  On  the  other  hand.  Sj  and  ,v/  are  not  yet  adjacent  when  the  algorithm 
starts,  because  the  sweep  line  starts  above  all  line  segments  and  the  status  is 
empty.  Hence,  there  must  be  an  event  point  q  where  ,v,  and  Sj  become  adjacent 
and  are  tested  for  intersection.  0 

So  our  approach  is  correct,  at  least  when  we  forget  about  the  nasty  cases 
mentioned  earlier.  Now  we  can  proceed  with  the  development  of  the  plane 
sweep  algorithm.  Let’s  briefly  recap  the  overall  approach.  We  imagine  moving 
a  horizontal  sweep  line  £  downwards  over  the  plane.  The  sweep  line  halts  at 
certain  event  points;  in  our  case  these  are  the  endpoints  of  the  segments,  which 
we  know  beforehand,  and  the  intersection  points,  which  are  computed  on  the 
fly.  While  the  sweep  line  moves  we  maintain  the  ordered  sequence  of  segments 
intersected  by  it.  When  the  sweep  line  halts  at  an  event  point  the  sequence  of 
segments  changes  and,  depending  on  the  type  of  event  point,  we  have  to  take 
several  actions  to  update  the  status  and  detect  intersections. 

When  the  event  point  is  the  upper  endpoint  of  a  segment,  there  is  a  new  segment 
intersecting  the  sweep  line.  This  segment  must  be  tested  for  intersection  against 
its  two  neighbors  along  the  sweep  line.  Only  intersection  points  below  the 
sweep  line  are  important;  the  ones  above  the  sweep  line  have  been  detected 
already.  For  example,  if  segments  ,v,  and  Sk  are  adjacent  on  the  sweep  line,  and 
a  new  upper  endpoint  of  a  segment  sj  appears  in  between,  then  we  have  to  test 
sj  for  intersection  with  ,v,  and  Sk-  If  we  find  an  intersection  below  the  sweep 
line,  we  have  found  a  new  event  point.  After  the  upper  endpoint  is  handled  we 
continue  to  the  next  event  point. 

When  the  event  point  is  an  intersection,  the  two  segments  that  intersect 
change  their  order.  Each  of  them  gets  (at  most)  one  new  neighbor  against  which 
it  is  tested  for  intersection.  Again,  only  intersections  below  the  sweep  line  are 
still  interesting.  Suppose  that  four  segments  Sj ,  Sk,  si,  and  sm  appear  in  this 
order  on  the  sweep  line  when  the  intersection  point  of  Sk  and  s /  is  reached.  Then 
Sk  and  s/  switch  position  and  we  must  test  .sy  and  sj  for  intersection  below  the 
sweep  line,  and  also  Sk  and  sm.  The  new  intersections  that  we  find  are,  of  course, 
also  event  points  for  the  algorithm.  Note,  however,  that  it  is  possible  that  these 
events  have  already  been  detected  earlier,  namely  if  a  pair  becoming  adjacent 
has  been  adjacent  before. 
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When  the  event  point  is  the  lower  endpoint  of  a  segment,  its  two  neighbors 
now  become  adjacent  and  must  be  tested  for  intersection.  If  they  intersect  below 
the  sweep  line,  then  their  intersection  point  is  an  event  point.  (Again,  this  event 
could  have  been  detected  already.)  Assume  three  segments  s^,  s/,  and  sm  appear 
in  this  order  on  the  sweep  line  when  the  lower  endpoint  of  .sy  is  encountered. 
Then  Sk  and  sm  will  become  adjacent  and  we  test  them  for  intersection. 

After  we  have  swept  the  whole  plane — more  precisely,  after  we  have  treated 
the  last  event  point — we  have  computed  all  intersection  points.  This  is  guaran¬ 
teed  by  the  following  invariant,  which  holds  at  any  time  during  the  plane  sweep: 
all  intersection  points  above  the  sweep  line  have  been  computed  correctly. 


After  this  sketch  of  the  algorithm,  it’s  time  to  go  into  more  detail.  It’s  also 
time  to  look  at  the  degenerate  cases  that  can  arise,  like  three  or  more  segments 
meeting  in  a  point.  We  should  first  specify  what  we  expect  from  the  algorithm 
in  these  cases.  We  could  require  the  algorithm  to  simply  report  each  intersection 
point  once,  but  it  seems  more  useful  if  it  reports  for  each  intersection  point  a 
list  of  segments  that  pass  through  it  or  have  it  as  an  endpoint.  There  is  another 
special  case  for  which  we  should  define  the  required  output  more  carefully, 
namely  that  of  two  partially  overlapping  segments,  but  for  simplicity  we  shall 
ignore  this  case  in  the  rest  of  this  section. 
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We  start  by  describing  the  data  structures  the  algorithm  uses. 

First  of  all  we  need  a  data  structure — called  the  event  queue — that  stores  the 
events.  We  denote  the  event  queue  by  Q.  We  need  an  operation  that  removes  the 
next  event  that  will  occur  from  Q,  and  returns  it  so  that  it  can  be  treated.  This 
event  is  the  highest  event  below  the  sweep  line.  If  two  event  points  have  the  same 
y-coordinate,  then  the  one  with  smaller  .v-coordinate  will  be  returned.  In  other 
words,  event  points  on  the  same  horizontal  line  are  treated  from  left  to  right. 
This  implies  that  we  should  consider  the  left  endpoint  of  a  horizontal  segment 
to  be  its  upper  endpoint,  and  its  right  endpoint  to  be  its  lower  endpoint.  You 
can  also  think  about  our  convention  as  follows:  instead  of  having  a  horizontal 
sweep  line,  imagine  it  is  sloping  just  a  tiny  bit  upward.  As  a  result  the  sweep 
line  reaches  the  left  endpoint  of  a  horizontal  segment  just  before  reaching  the 
right  endpoint.  The  event  queue  must  allow  insertions,  because  new  events  will 
be  computed  on  the  fly.  Notice  that  two  event  points  can  coincide.  For  example, 
the  upper  endpoints  of  two  distinct  segments  may  coincide.  It  is  convenient  to 
treat  this  as  one  event  point.  Hence,  an  insertion  must  be  able  to  check  whether 
an  event  is  already  present  in  Q. 

We  implement  the  event  queue  as  follows.  Define  an  order  -<  on  the  event 
points  that  represents  the  order  in  which  they  will  be  handled.  Hence,  if  p  and  q 
are  two  event  points  then  we  have  p  -<  q  if  and  only  if  py  >  qy  holds  or  py  =  qy 
and  px  <  qx  holds.  We  store  the  event  points  in  a  balanced  binary  search  tree, 
ordered  according  to  -< .  With  each  event  point  p  in  Q  we  will  store  the  segments 
starting  at  p,  that  is,  the  segments  whose  upper  endpoint  is  p.  This  information 
will  be  needed  to  handle  the  event.  Both  operations — fetching  the  next  event 
and  inserting  an  event — take  O(\ogm)  time,  where  m  is  the  number  of  events 


in  Q.  (We  do  not  use  a  heap  to  implement  the  event  queue,  because  we  have  to 
be  able  to  test  whether  a  given  event  is  already  present  in  Q.) 

Second,  we  need  to  maintain  the  status  of  the  algorithm.  This  is  the  ordered 
sequence  of  segments  intersecting  the  sweep  line.  The  status  structure,  denoted 
by  T,  is  used  to  access  the  neighbors  of  a  given  segment  s ,  so  that  they  can  be 
tested  for  intersection  with  s.  The  status  structure  must  be  dynamic:  as  segments 
start  or  stop  to  intersect  the  sweep  line,  they  must  be  inserted  into  or  deleted 
from  the  structure.  Because  there  is  a  well-defined  order  on  the  segments  in 
the  status  structure  we  can  use  a  balanced  binary  search  tree  as  status  structure. 
When  you  are  only  used  to  binary  search  trees  that  store  numbers,  this  may  be 
surprising.  But  binary  search  trees  can  store  any  set  of  elements,  as  long  as 
there  is  an  order  on  the  elements. 

In  more  detail,  we  store  the  segments  intersecting  the  sweep  line  ordered 
in  the  leaves  of  a  balanced  binary  search  tree  T.  The  left-to-right  order  of 
the  segments  along  the  sweep  line  corresponds  to  the  left-to-right  order  of  the 
leaves  in  T.  We  must  also  store  information  in  the  internal  nodes  to  guide  the 
search  down  the  tree  to  the  leaves.  At  each  internal  node,  we  store  the  segment 
from  the  rightmost  leaf  in  its  left  subtree.  (Alternatively,  we  could  store  the 
segments  only  in  interior  nodes.  This  will  save  some  storage.  However,  it  is 
conceptually  simpler  to  think  about  the  segments  in  interior  nodes  as  values 
to  guide  the  search,  not  as  data  items.  Storing  the  segments  in  the  leaves  also 
makes  some  algorithms  simpler  to  describe.)  Suppose  we  search  in  T  for  the 
segment  immediately  to  the  left  of  some  point  p  that  lies  on  the  sweep  line.  At 
each  internal  node  v  we  test  whether  p  lies  left  or  right  of  the  segment  stored 
at  v.  Depending  on  the  outcome  we  descend  to  the  left  or  right  subtree  of  v, 
eventually  ending  up  in  a  leaf.  Either  this  leaf,  or  the  leaf  immediately  to  the  left 
of  it,  stores  the  segment  we  are  searching  for.  In  a  similar  way  we  can  find  the 
segment  immediately  to  the  right  of  p,  or  the  segments  containing  p.  It  follows 
that  each  update  and  neighbor  search  operation  takes  <9(log«)  time. 

The  event  queue  Q  and  the  status  structure  T  are  the  only  two  data  structures 
we  need.  The  global  algorithm  can  now  be  described  as  follows. 
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Algorithm  FlNDlNTERSECTIONS(S) 

Input.  A  set  S  of  line  segments  in  the  plane. 

Output.  The  set  of  intersection  points  among  the  segments  in  5,  with  for  each 
intersection  point  the  segments  that  contain  it. 

1.  Initialize  an  empty  event  queue  Q.  Next,  insert  the  segment  endpoints  into 
Q;  when  an  upper  endpoint  is  inserted,  the  corresponding  segment  should 
be  stored  with  it. 

2.  Initialize  an  empty  status  structure  T. 

3.  while  Q  is  not  empty 

4.  do  Determine  the  next  event  point  p  in  Q  and  delete  it. 

5.  HandleEventPoint(p) 

We  have  already  seen  how  events  are  handled:  at  endpoints  of  segments  we 
have  to  insert  or  delete  segments  from  the  status  structure  T,  and  at  intersection 

points  we  have  to  change  the  order  of  two  segments.  In  both  cases  we  also  _ 

have  to  do  intersection  tests  between  segments  that  become  neighbors  after  the  25 
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event.  In  degenerate  cases — where  several  segments  are  involved  in  one  event 
point — the  details  are  a  little  bit  more  tricky.  The  next  procedure  describes  how 
to  handle  event  points  correctly;  it  is  illustrated  in  Figure  2.2. 


Figure  2.2 

An  event  point  and  the  changes  in  the 
status  structure 
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HandleEventPoint(  p) 

1 .  Let  U  ( p )  be  the  set  of  segments  whose  upper  endpoint  is  p\  these  segments 
are  stored  with  the  event  point  p.  (For  horizontal  segments,  the  upper 
endpoint  is  by  definition  the  left  endpoint.) 

2.  Find  all  segments  stored  in  T  that  contain  p\  they  are  adjacent  in  'T.  Let 
L(p)  denote  the  subset  of  segments  found  whose  lower  endpoint  is  p,  and 
let  C(p)  denote  the  subset  of  segments  found  that  contain  p  in  their  interior. 

3.  if  L(p)  U  U(p)  U  C(p)  contains  more  than  one  segment 

4.  then  Report  p  as  an  intersection,  together  with  Up).  U(p),  and  C(p). 

5.  Delete  the  segments  in  L(p)  U  C(p)  from  T. 

6.  Insert  the  segments  in  U  (p)  U  C(p)  into  7.  The  order  of  the  segments  in  T 
should  correspond  to  the  order  in  which  they  are  intersected  by  a  sweep 
line  just  below  p.  If  there  is  a  horizontal  segment,  it  comes  last  among  all 
segments  containing  p. 

7.  (*  Deleting  and  re-inserting  the  segments  of  C(p)  reverses  their  order.  *) 

8.  ifU(p)UC(p)  =  (D 

9.  then  Let  ,sy  and  ,v,  be  the  left  and  right  neighbors  of  p  in  7. 

10.  FlNDNEWEVENT(s;,Sr,^) 

1 1 .  else  Let  s'  be  the  leftmost  segment  of  U (p)  U  C(p)  in  T. 

12.  Let  si  be  the  left  neighbor  of  s'  in  7. 

13.  FindNewEvent(s;,s',^) 

14.  Let  s"  be  the  rightmost  segment  of  U (p)  UC(p)  in  7. 

15.  Let  sr  be  the  right  neighbor  of  s"  in  T. 

16.  FindNewEvent(s,/  ,  sr ,  p) 

_  Note  that  in  lines  8-16  we  assume  that  si  and  sr  actually  exist.  If  they  do  not 

26  exist  the  corresponding  steps  should  obviously  not  be  performed. 


The  procedures  for  finding  the  new  intersections  are  easy:  they  simply  test 
two  segments  for  intersection.  The  only  thing  we  need  to  be  careful  about  is, 
when  we  find  an  intersection,  whether  this  intersection  has  already  been  handled 
earlier  or  not.  When  there  are  no  horizontal  segments,  then  the  intersection 
has  not  been  handled  yet  when  the  intersection  point  lies  below  the  sweep  line. 
But  how  should  we  deal  with  horizontal  segments?  Recall  our  convention  that 
events  with  the  same  y-coordinate  are  treated  from  left  to  right.  This  implies 
that  we  are  still  interested  in  intersection  points  lying  to  the  right  of  the  current 
event  point.  Hence,  the  procedure  FindNewEvent  is  defined  as  follows. 

FindN  ewE  VENT(i/ ,  Sr ,  p ) 

1 .  if  si  and  sr  intersect  below  the  sweep  line,  or  on  it  and  to  the  right  of  the 

current  event  point  p,  and  the  intersection  is  not  yet  present  as  an 
event  in  Q 

2.  then  Insert  the  intersection  point  as  an  event  into  Q. 


What  about  the  correctness  of  our  algorithm?  It  is  clear  that  FlNDlNTERSEC- 
TIONS  only  reports  true  intersection  points,  but  does  it  find  all  of  them?  The 
next  lemma  states  that  this  is  indeed  the  case. 

Lemma  2.2  Algorithm  FlNDlNTERSECTlONS  computes  all  intersection  points 
and  the  segments  that  contain  it  correctly. 

Proof.  Recall  that  the  priority  of  an  event  is  given  by  its  y-coordinate,  and  that 
when  two  events  have  the  same  y-coordinate  the  one  with  smaller  x-coordinate 
is  given  higher  priority.  We  shall  prove  the  lemma  by  induction  on  the  priority 
of  the  event  points. 

Let  p  be  an  intersection  point  and  assume  that  all  intersection  points  q  with 
a  higher  priority  have  been  computed  correctly.  We  shall  prove  that  p  and 
the  segments  that  contain  p  are  computed  correctly.  Let  U  (p)  be  the  set  of 
segments  that  have  p  as  their  upper  endpoint  (or,  for  horizontal  segments,  their 
left  endpoint),  let  Lip)  be  the  set  of  segments  having  p  as  their  lower  endpoint 
(or,  for  horizontal  segments,  their  right  endpoint),  and  let  C(p)  be  the  set  of 
segments  having  p  in  their  interior. 

First,  assume  that  p  is  an  endpoint  of  one  or  more  of  the  segments.  In  that 
case  p  is  stored  in  the  event  queue  Q  at  the  start  of  the  algorithm.  The  segments 
from  U  (p)  are  stored  with  p,  so  they  will  be  found.  The  segments  from  Lip) 
and  C(p)  are  stored  in  T  when  p  is  handled,  so  they  will  be  found  in  line  2  of 
HandleEventPoint.  Hence,  p  and  all  the  segments  involved  are  determined 
correctly  when  p  is  an  endpoint  of  one  or  more  of  the  segments. 

Now  assume  that  p  is  not  an  endpoint  of  a  segment.  All  we  need  to  show  is 
that  p  will  be  inserted  into  Q  at  some  moment.  Note  that  all  segments  that  are 
involved  have  p  in  their  interior.  Order  these  segments  by  angle  around  p,  and 
let  s,  and  sj  be  two  neighboring  segments.  Following  the  proof  of  Lemma  2.1 
we  see  that  there  is  an  event  point  with  a  higher  priority  than  p  such  that  Sj  and 
sj  become  adjacent  when  q  is  passed.  In  Lemma  2.1  we  assumed  for  simplicity 
that  Sj  and  Sj  are  non-horizontal,  but  it  is  straightforward  to  adapt  the  proof  for 
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Chapter  2  horizontal  segments.  By  induction,  the  event  point  q  was  handled  correctly, 

line  segment  intersection  which  means  that  p  is  detected  and  stored  into  Q.  ED 

So  we  have  a  correct  algorithm.  But  did  we  succeed  in  developing  an  output- 
sensitive  algorithm?  The  answer  is  yes:  the  running  time  of  the  algorithm  is 
0((n  +  k)  logn),  where  k  is  the  size  of  the  output.  The  following  lemma  states 
an  even  stronger  result:  the  running  time  is  0((n  +  1)  logn),  where  I  is  the 
number  of  intersections.  This  is  stronger,  because  for  one  intersection  point  the 
output  can  consist  of  a  large  number  of  segments,  namely  in  the  case  where 
many  segments  intersect  in  a  common  point. 

Lemma  2.3  The  running  time  of  Algorithm  FlNDlNTERSECTlONS  for  a  set  S 
ofn  line  segments  in  the  plane  is  0(n  log/;  +  /  log/;),  where  I  is  the  number  of 
intersection  points  of  segments  in  S. 

Proof.  The  algorithm  starts  by  constructing  the  event  queue  on  the  segment 
endpoints.  Because  we  implemented  the  event  queue  as  a  balanced  binary 
search  tree,  this  takes  0(nlogn)  time.  Initializing  the  status  structure  takes 
constant  time.  Then  the  plane  sweep  starts  and  all  the  events  are  handled.  To 
handle  an  event  we  perform  three  operations  on  the  event  queue  Q:  the  event 
itself  is  deleted  from  Q  in  line  4  of  FlNDlNTERSECTlONS,  and  there  can  be  one 
or  two  calls  to  FindNewEvent,  which  may  cause  at  most  two  new  events  to 
be  inserted  into  Q.  Deletions  and  insertions  on  Q  take  O(logn)  time  each.  We 
also  perform  operations — insertions,  deletions,  and  neighbor  finding — on  the 
status  structure  7,  which  take  0(log«)  time  each.  The  number  of  operations 
is  linear  in  the  number  m(p)  :=  card (L(p)  UU(p)  U  C(p))  of  segments  that  are 
involved  in  the  event.  If  we  denote  the  sum  of  all  m(p),  over  all  event  points  p, 
by  m,  the  running  time  of  the  algorithm  is  Ofnlogn). 

It  is  clear  that  m  —  Ofn  +k),  where  k  is  the  size  of  the  output;  after  all. 
whenever  m(p)  >  1  we  report  all  segments  involved  in  the  event,  and  the  only 
events  involving  one  segment  are  the  endpoints  of  segments.  But  we  want 
to  prove  that  m  =  0(n  +  /),  where  I  is  the  number  of  intersection  points.  To 
show  this,  we  will  interpret  the  set  of  segments  as  a  planar  graph  embedded  in 
the  plane.  (If  you  are  not  familiar  with  planar  graph  terminology,  you  should 
read  the  first  paragraphs  of  Section  2.2  first.)  Its  vertices  are  the  endpoints  of 
segments  and  intersection  points  of  segments,  and  its  edges  are  the  pieces  of 
the  segments  connecting  vertices.  Consider  an  event  point  p.  It  is  a  vertex  of 
the  graph,  and  m(p)  is  bounded  by  the  degree  of  the  vertex.  Consequently,  m  is 
bounded  by  the  sum  of  the  degrees  of  all  vertices  of  our  graph.  Every  edge  of 
the  graph  contributes  one  to  the  degree  of  exactly  two  vertices  (its  endpoints), 
so  m  is  bounded  by  2 ne,  where  ne  is  the  number  of  edges  of  the  graph.  Let’s 
bound  ne  in  terms  of  n  and  I.  By  definition,  nv,  the  number  of  vertices,  is  at 
most  2 n+I.  It  is  well  known  that  in  planar  graphs  ne  =  0(nv),  which  proves  our 
claim.  But,  for  completeness,  let  us  give  the  argument  here.  Every  face  of  the 
planar  graph  is  bounded  by  at  least  three  edges — provided  that  there  are  at  least 
three  segments — and  an  edge  can  bound  at  most  two  different  faces.  Therefore 
n/,  the  number  of  faces,  is  at  most  2ne/3.  We  now  use  Euler’s  formula,  which 
states  that  for  any  planar  graph  with  nv  vertices,  ne  edges,  and  n j  faces,  the 
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following  relation  holds: 
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nv  —  ne  +  rif  ^  2. 

Equality  holds  if  and  only  if  the  graph  is  connected.  Plugging  the  bounds  on  nv 
and  n j  into  this  formula,  we  get 

2^2 

2  ^  (2 n  +/)  —  ne  -\ — =  (2 n  +  I)  —  nej 3. 

So  ne  bn  I  3 /  6,  and  ///  ^  12//  — j—  1/ /  12,  and  the  bound  on  the  running  time 

follows.  0 

We  still  have  to  analyze  the  other  complexity  aspect,  the  amount  of  storage 
used  by  the  algorithm.  The  tree  T  stores  a  segment  at  most  once,  so  it  uses  0(n) 
storage.  The  size  of  Q  can  be  larger,  however.  The  algorithm  inserts  intersection 
points  in  Q  when  they  are  detected  and  it  removes  them  when  they  are  handled 
When  it  takes  a  long  time  before  intersections  are  handled,  it  could  happen  that 
Q  gets  very  large.  Of  course  its  size  is  always  bounded  by  0(n  +/),  but  it  would 
be  better  if  the  working  storage  were  always  linear. 

There  is  a  relatively  simple  way  to  achieve  this:  only  store  intersection 
points  of  pairs  of  segments  that  are  currently  adjacent  on  the  sweep  line.  The 
algorithm  given  above  also  stores  intersection  points  of  segments  that  have 
been  horizontally  adjacent,  but  aren’t  anymore.  By  storing  only  intersections 
among  adjacent  segments,  the  number  of  event  points  in  Q  is  never  more  than 
linear.  The  modification  required  in  the  algorithm  is  that  the  intersection  point 
of  two  segments  must  be  deleted  when  they  stop  being  adjacent.  These  segments 
must  become  adjacent  again  before  the  intersection  point  is  reached,  so  the 
intersection  point  will  still  be  reported  correctly.  The  total  time  taken  by  the 
algorithm  remains  0(n\ogn  +  /log«).  We  obtain  the  following  theorem: 

Theorem  2.4  Let  S  be  a  set  of  n  line  segments  in  the  plane.  All  intersection 
points  in  S,  with  for  each  intersection  point  the  segments  involved  in  it,  can  be 
reported  in  O(n\ogn  +  /logn)  time  and  0(n)  space,  where  I  is  the  number  of 
intersection  points. 


2.2  The  Doubly-Connected  Edge  List 

We  have  solved  the  easiest  case  of  the  map  overlay  problem,  where  the  two 
maps  are  networks  represented  as  collections  of  line  segments.  In  general, 
maps  have  a  more  complicated  structure:  they  are  subdivisions  of  the  plane  into 
labeled  regions.  A  thematic  map  of  forests  in  Canada,  for  instance,  would  be 
a  subdivision  of  Canada  into  regions  with  labels  such  as  “pine”,  “deciduous”, 
“birch”,  and  “mixed”. 

Before  we  can  give  an  algorithm  for  computing  the  overlay  of  two  subdivi¬ 
sions,  we  must  develop  a  suitable  representation  for  a  subdivision.  Storing  a 
subdivision  as  a  collection  of  line  segments  is  not  such  a  good  idea.  Operations 
like  reporting  the  boundary  of  a  region  would  be  rather  complicated.  It  is  better 
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to  incorporate  structural,  topological  information:  which  segments  bound  a 
given  region,  which  regions  are  adjacent,  and  so  on. 


The  maps  we  consider  are  planar  subdivisions  induced  by  planar  embeddings 
of  graphs.  Such  a  subdivision  is  connected  if  the  underlying  graph  is  connected. 
The  embedding  of  a  node  of  the  graph  is  called  a  vertex,  and  the  embedding  of 
an  arc  is  called  an  edge.  We  only  consider  embeddings  where  every  edge  is  a 
straight  line  segment.  In  principle,  edges  in  a  subdivision  need  not  be  straight. 
A  subdivision  need  not  even  be  a  planar  embedding  of  a  graph,  as  it  may  have 
unbounded  edges.  In  this  section,  however,  we  don't  consider  such  more  general 
subdivisions.  We  consider  an  edge  to  be  open,  that  is,  its  endpoints — which  are 
vertices  of  the  subdivision — are  not  part  of  it.  A  face  of  the  subdivision  is  a 
maximal  connected  subset  of  the  plane  that  doesn’t  contain  a  point  on  an  edge 
or  a  vertex.  Thus  a  face  is  an  open  polygonal  region  whose  boundary  is  formed 
by  edges  and  vertices  from  the  subdivision.  The  complexity  of  a  subdivision 
is  defined  as  the  sum  of  the  number  of  vertices,  the  number  of  edges,  and  the 
number  of  faces  it  consists  of.  If  a  vertex  is  the  endpoint  of  an  edge,  then  we 
say  that  the  vertex  and  the  edge  are  incident.  Similarly,  a  face  and  an  edge  on 
its  boundary  are  incident,  and  a  face  and  a  vertex  of  its  boundary  are  incident. 

What  should  we  require  from  a  representation  of  a  subdivision?  An  opera¬ 
tion  one  could  ask  for  is  to  determine  the  face  containing  a  given  point.  This 
is  definitely  useful  in  some  applications — indeed,  in  a  later  chapter  we  shall 
design  a  data  structure  for  this — but  it  is  a  bit  too  much  to  ask  from  a  basic 
representation.  The  things  we  can  ask  for  should  be  more  local.  For  example,  it 
is  reasonable  to  require  that  we  can  walk  around  the  boundary  of  a  given  face, 
or  that  we  can  access  one  face  from  an  adjacent  one  if  we  are  given  a  common 
edge.  Another  operation  that  could  be  useful  is  to  visit  all  the  edges  around  a 
given  vertex.  The  representation  that  we  shall  discuss  supports  these  operations. 
It  is  called  the  doubly-connected  edge  list. 
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A  doubly-connected  edge  list  contains  a  record  for  each  face,  edge,  and  vertex 


of  the  subdivision.  Besides  the  geometric  and  topological  information — to 
be  described  shortly — each  record  may  also  store  additional  information.  For 
instance,  if  the  subdivision  represents  a  thematic  map  for  vegetation,  the  doubly- 
connected  edge  list  would  store  in  each  face  record  the  type  of  vegetation  of 
the  corresponding  region.  The  additional  information  is  also  called  attribute 
information.  The  geometric  and  topological  information  stored  in  the  doubly- 
connected  edge  list  should  enable  us  to  perform  the  basic  operations  mentioned 
earlier.  To  be  able  to  walk  around  a  face  in  counterclockwise  order  we  store  a 
pointer  from  each  edge  to  the  next.  It  can  also  come  in  handy  to  walk  around 
a  face  the  other  way,  so  we  also  store  a  pointer  to  the  previous  edge.  An  edge 
usually  bounds  two  faces,  so  we  need  two  pairs  of  pointers  for  it.  It  is  convenient 
to  view  the  different  sides  of  an  edge  as  two  distinct  half-edges,  so  that  we  have 
a  unique  next  half-edge  and  previous  half-edge  for  every  half-edge.  This  also 
means  that  a  half-edge  bounds  only  one  face.  The  two  half-edges  we  get  for  a 
given  edge  are  called  twins.  Defining  the  next  half-edge  of  a  given  half-edge 
with  respect  to  a  counterclockwise  traversal  of  a  face  induces  an  orientation  on 
each  half-edge:  it  is  oriented  such  that  the  face  that  it  bounds  lies  to  its  left  for 
an  observer  walking  along  the  edge.  Because  half-edges  are  oriented  we  can 
speak  of  the  origin  and  the  destination  of  a  half-edge.  If  a  half-edge  e  has  v  as  its 
origin  and  w  as  its  destination,  then  its  twin  Twinfe )  has  w  as  its  origin  and  v  as 
its  destination.  To  reach  the  boundary  of  a  face  we  just  need  to  store  one  pointer 
in  the  face  record  to  an  arbitrary  half-edge  bounding  the  face.  Starting  from 
that  half-edge,  we  can  step  from  each  half-edge  to  the  next  and  walk  around  the 
face. 

What  we  just  said  does  not  quite  hold  for  the  boundaries  of  holes  in  a  face: 
if  they  are  traversed  in  counterclockwise  order  then  the  face  lies  to  the  right.  It 
will  be  convenient  to  orient  half-edges  such  that  their  face  always  lies  to  the 
same  side,  so  we  change  the  direction  of  traversal  for  the  boundary  of  a  hole  to 
clockwise.  Now  a  face  always  lies  to  the  left  of  any  half-edge  on  its  boundary. 
Another  consequence  is  that  twin  half-edges  always  have  opposite  orientations. 
The  presence  of  holes  in  a  face  also  means  that  one  pointer  from  the  face  to  an 
arbitrary  half-edge  on  its  boundary  is  not  enough  to  visit  the  whole  boundary: 
we  need  a  pointer  to  a  half-edge  in  every  boundary  component.  If  a  face  has 
isolated  vertices  that  don’t  have  any  incident  edge,  we  can  store  pointers  to  them 
as  well.  For  simplicity  we’ll  ignore  this  case. 

Let’s  summarize.  The  doubly-connected  edge  list  consists  of  three  collections 
of  records:  one  for  the  vertices,  one  for  the  faces,  and  one  for  the  half-edges. 
These  records  store  the  following  geometric  and  topological  information: 
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■  The  vertex  record  of  a  vertex  v  stores  the  coordinates  of  v  in  a  field  called 
Coordinates(y) .  It  also  stores  a  pointer  IncidentEdge(y)  to  an  arbitrary 
half-edge  that  has  v  as  its  origin. 


■  The  face  record  of  a  face  /  stores  a  pointer  OuterComponent(f)  to  some 
half-edge  on  its  outer  boundary.  For  the  unbounded  face  this  pointer  is  nil. 
It  also  stores  a  list  InnerComponents(f),  which  contains  for  each  hole  in 
the  face  a  pointer  to  some  half-edge  on  the  boundary  of  the  hole. 
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Origin  (e) 


m  The  half-edge  record  of  a  half-edge  e  stores  a  pointer  Origin(e )  to  its  origin, 
a  pointer  Twin(e)  to  its  twin  half-edge,  and  a  pointer  IncidentFace(e)  to 
the  face  that  it  bounds.  We  don’t  need  to  store  the  destination  of  an  edge, 
because  it  is  equal  to  Origin(Twin(e)) .  The  origin  is  chosen  such  that 
IncidenlFace{e)  lies  to  the  left  of  e  when  it  is  traversed  from  origin  to 
destination.  The  half-edge  record  also  stores  pointers  Next(e)  and  Prev(e ) 
to  the  next  and  previous  edge  on  the  boundary  of  Incident  Face  (e) .  Thus 
Next(e)  is  the  unique  half-edge  on  the  boundary  of  IncidentFace(e)  that  has 
the  destination  of  e  as  its  origin,  and  Prev(e)  is  the  unique  half-edge  on  the 
boundary  of  Incident  Face(e)  that  has  Origin(e )  as  its  destination. 

A  constant  amount  of  information  is  used  for  each  vertex  and  edge.  A  face  may 
require  more  storage,  since  the  list  InnerComponents(f)  has  as  many  elements 
as  there  are  holes  in  the  face.  Because  any  half-edge  is  pointed  to  at  most  once 
from  all  InnerComponents(f)  lists  together,  we  conclude  that  the  amount  of 
storage  is  linear  in  the  complexity  of  the  subdivision.  An  example  of  a  doubly- 
connected  edge  list  for  a  simple  subdivision  is  given  below.  The  two  half-edges 
corresponding  to  an  edge  e,  are  labeled  e,  j  and  e,2- 


Vertex 

Coordinates 

IncidentEdge 

i'i 

(0,4) 

et,i 

V'2 

(2,4) 

64,2 

V3 

(2,2) 

62,1 

V4 

(1,1) 

62,2 

Face 

OuterComponent 

InnerComponents 

nil 

ei,i 

f2 

64,1 

nil 

Half-edge 

Origin 

Twin 

IncidentFace 

Next 

Prev 

ei,i 

Vl 

6 1,2 

fi 

64,2 

63,1 

61,2 

V2 

e  1,1 

f2 

63,2 

64,1 

ei,i 

V3 

62,2 

h 

62,2 

64,2 

62,2 

V4 

62,1 

h 

63,1 

62,1 

63,1 

V3 

63,2 

h 

61,1 

62,2 

63,2 

Vl 

63,1 

f2 

64,1 

6 1,2 

64,1 

V3 

64,2 

f2 

6 1,2 

63,2 

64,2 

V2 

64,1 

h 

62,1 

61,1 

The  information  stored  in  the  doubly-connected  edge  list  is  enough  to  perform 
the  basic  operations.  For  example,  we  can  walk  around  the  outer  boundary 
of  a  given  face  /  by  following  Next(e)  pointers,  starting  from  the  half-edge 
OuterComponent(f).  We  can  also  visit  all  edges  incident  to  a  vertex  v.  It  is  a 
good  exercise  to  figure  out  for  yourself  how  to  do  this. 


We  described  a  fairly  general  version  of  the  doubly-connected  edge  list.  In 
applications  where  the  vertices  carry  no  attribute  information  we  could  store 


their  coordinates  directly  in  the  Origin ()  field  of  the  edge;  there  is  no  strict  need 
for  a  separate  type  of  vertex  record.  Even  more  important  is  to  realize  that  in 
many  applications  the  faces  of  the  subdivision  carry  no  interesting  meaning 
(think  of  the  network  of  rivers  or  roads  that  we  looked  at  before).  If  that  is  the 
case,  we  can  completely  forget  about  the  face  records,  and  the  IncidentFace () 
field  of  half-edges.  As  we  will  see,  the  algorithm  of  the  next  section  doesn’t 
need  these  fields  (and  is  actually  simpler  to  implement  if  we  don’t  need  to 
update  them).  Some  implementations  of  doubly-connected  edge  lists  may  also 
insist  that  the  graph  formed  by  the  vertices  and  edges  of  the  subdivision  be 
connected.  This  can  always  be  achieved  by  introducing  dummy  edges,  and 
has  two  advantages.  Firstly,  a  simple  graph  transversal  can  be  used  to  visit  all 
half-edges,  and  secondly,  the  InnerComponents ()  list  for  faces  is  not  necessary. 


2.3  Computing  the  Overlay  of  Two  Subdivisions 

Now  that  we  have  designed  a  good  representation  of  a  subdivision,  we  can  tackle 
the  general  map  overlay  problem.  We  define  the  overlay  of  two  subdivisions  Si 
and  St  to  be  the  subdivision  0(81,82)  such  that  there  is  a  face  /  in  0(81,82) 
if  and  only  if  there  are  faces  f\  in  Si  and  /2  in  82  such  that  /  is  a  maximal 
connected  subset  of  f\  fl  fi-  This  sounds  more  complicated  than  it  is:  what  it 
means  is  that  the  overlay  is  the  subdivision  of  the  plane  induced  by  the  edges 
from  Si  and  82-  Figure  2.4  illustrates  this.  The  general  map  overlay  problem 
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is  to  compute  a  doubly-connected  edge  list  for  0(81,82),  given  the  doubly- 
connected  edge  lists  of  Si  and  82-  We  require  that  each  face  in  0(81,82)  be 
labeled  with  the  labels  of  the  faces  in  Si  and  82  that  contain  it.  This  way  we 
have  access  to  the  attribute  information  stored  for  these  faces.  In  an  overlay  of  a 
vegetation  map  and  a  precipitation  map  this  would  mean  that  we  know  for  each 
region  in  the  overlay  the  type  of  vegetation  and  the  amount  of  precipitation. 

Let’s  first  see  how  much  information  from  the  doubly-connected  edge  lists 
for  Si  and  82  we  can  re-use  in  the  doubly-connected  edge  list  for  0(81,82). 
Consider  the  network  of  edges  and  vertices  of  Si.  This  network  is  cut  into  pieces 
by  the  edges  of  82-  These  pieces  are  for  a  large  part  re-usable;  only  the  edges 
that  have  been  cut  by  the  edges  of  82  should  be  renewed.  But  does  this  also 
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Figure  2.4 

Overlaying  two  subdivisions 
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Chapter  2  hold  for  the  half-edge  records  in  the  doubly-connected  edge  list  that  correspond 

line  segment  intersection  to  the  pieces?  If  the  orientation  of  a  half-edge  would  change,  we  would  still 

have  to  change  the  information  in  these  records.  Fortunately,  this  is  not  the  case. 
The  half-edges  are  oriented  such  that  the  face  that  they  bound  lies  to  the  left; 
the  shape  of  the  face  may  change  in  the  overlay,  but  it  will  remain  to  the  same 
side  of  the  half-edge.  Hence,  we  can  re-use  half-edge  records  corresponding  to 
edges  that  are  not  intersected  by  edges  from  the  other  map.  Stated  differently, 
the  only  half-edge  records  in  the  doubly-connected  edge  list  for  0(§i ,  §2)  that 
we  cannot  borrow  from  Si  or  82  are  the  ones  that  are  incident  to  an  intersection 
between  edges  from  different  maps. 

This  suggests  the  following  approach.  First,  copy  the  doubly-connected 
edge  lists  of  Si  and  82  into  one  new  doubly-connected  edge  list.  The  new 
doubly-connected  edge  list  is  not  a  valid  doubly-connected  edge  list,  of  course, 
in  the  sense  that  it  does  not  yet  represent  a  planar  subdivision.  This  is  the  task 
of  the  overlay  algorithm:  it  must  transform  the  doubly-connected  edge  list  into 
a  valid  doubly-connected  edge  list  for  0(8i ,  82)  by  computing  the  intersections 
between  the  two  networks  of  edges,  and  linking  together  the  appropriate  parts 
of  the  two  doubly-connected  edge  lists. 

We  did  not  talk  about  the  new  face  records  yet.  The  information  for  these 
records  is  more  difficult  to  compute,  so  we  leave  this  for  later.  We  first  describe 
in  a  little  more  detail  how  the  vertex  and  half-edge  records  of  the  doubly- 
connected  edge  list  for  0(81,82)  are  computed. 


Our  algorithm  is  based  on  the  plane  sweep  algorithm  of  Section  2.1  for  com¬ 
puting  the  intersections  in  a  set  of  line  segments.  We  run  this  algorithm  on  the 
set  of  segments  that  is  the  union  of  the  sets  of  edges  of  the  two  subdivisions 
Si  and  82-  Here  we  consider  the  edges  to  be  closed.  Recall  that  the  algorithm 
is  supported  by  two  data  structures:  an  event  queue  Q,  which  stores  the  event 
points,  and  the  status  structure  T,  which  is  a  balanced  binary  search  tree  storing 
the  segments  intersecting  the  sweep  line,  ordered  from  left  to  right.  We  now 
also  maintain  a  doubly-connected  edge  list  CD.  Initially,  CD  contains  a  copy 
of  the  doubly-connected  edge  list  for  Si  and  a  copy  of  the  doubly-connected 
edge  list  for  82-  During  the  plane  sweep  we  shall  transform  CD  to  a  correct 
doubly-connected  edge  list  for  0(81,82).  That  is  to  say,  as  far  as  the  vertex 
and  half-edge  records  are  concerned;  the  face  information  will  be  computed 
later.  We  keep  cross  pointers  between  the  edges  in  the  status  structure  CT  and 
the  half-edge  records  in  CD  that  correspond  to  them.  This  way  we  can  access  the 
part  of  CD  that  needs  to  be  changed  when  we  encounter  an  intersection  point. 
The  invariant  that  we  maintain  is  that  at  any  time  during  the  sweep,  the  part  of 
the  overlay  above  the  sweep  line  has  been  computed  correctly. 

Now,  let’s  consider  what  we  must  do  when  we  reach  an  event  point.  First 
of  all,  we  update  CT  and  Q  as  in  the  line  segment  intersection  algorithm.  If  the 
event  involves  only  edges  from  one  of  the  two  subdivisions,  this  is  all;  the  event 
point  is  a  vertex  that  can  be  re-used.  If  the  event  involves  edges  from  both 
subdivisions,  we  must  make  local  changes  to  CD  to  link  the  doubly-connected 
edge  lists  of  the  two  original  subdivisions  at  the  intersection  point.  This  is 
tedious  but  not  difficult. 


the  geometric  situation  and  the 
two  doubly-connected  edge  lists 
before  handling  the  intersection 


the  doubly-connected  edge  list 
after  handling  the  intersection 
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We  describe  the  details  for  one  of  the  possible  cases,  namely  when  an  edge 
e  of  §1  passes  through  a  vertex  v  of  §2,  see  Figure  2.5.  The  edge  e  must  be 
replaced  by  two  edges  denoted  e'  and  e" .  In  the  doubly-connected  edge  list,  the 
two  half-edges  for  e  must  become  four.  We  create  two  new  half-edge  records, 
both  with  v  as  the  origin.  The  two  existing  half-edges  for  e  keep  the  endpoints  of 
e  as  their  origin,  as  shown  in  Figure  2.5.  Then  we  pair  up  the  existing  half-edges 
with  the  new  half-edges  by  setting  their  TwinQ)  pointers.  So  e'  is  represented 
by  one  new  and  one  existing  half-edge,  and  the  same  holds  for  e" .  Now  we 
must  set  a  number  of  PrevQ)  and  NextQ)  pointers.  We  first  deal  with  the  situation 
around  the  endpoints  of  e\  later  we’ll  worry  about  the  situation  around  v.  The 
NextQ)  pointers  of  the  two  new  half-edges  each  copy  the  NextQ)  pointer  of  the 
old  half-edge  that  is  not  its  twin.  The  half-edges  to  which  these  pointers  point 
must  also  update  their  PrevQ  pointer  and  set  it  to  the  new  half-edges.  The 
correctness  of  this  step  can  be  verified  best  by  looking  at  a  figure. 

It  remains  to  correct  the  situation  around  vertex  v.  We  must  set  the  NextQ 
and  PrevQ)  pointers  of  the  four  half-edges  representing  e'  and  e" ,  and  of  the  four 
half-edges  incident  from  82  to  v.  We  locate  these  four  half-edges  from  82  by 
testing  where  e'  and  e"  should  be  in  the  cyclic  order  of  the  edges  around  vertex 
v.  There  are  four  pairs  of  half-edges  that  become  linked  by  a  NextQ  pointer 
from  the  one  and  a  PrevQ)  pointer  from  the  other.  Consider  the  half-edge  for 
e'  that  has  v  as  its  destination.  It  must  be  linked  to  the  first  half-edge,  seen 
clockwise  from  <?',  with  v  as  its  origin.  The  half-edge  for  e'  with  v  as  its  origin 
must  be  linked  to  the  first  counterclockwise  half-edge  with  v  as  its  destination. 
The  same  statements  hold  for  e" . 

Most  of  the  steps  in  the  description  above  take  only  constant  time.  Only 
locating  where  e'  and  e"  appear  in  the  cyclic  order  around  v  may  take  longer: 
it  will  take  time  linear  in  the  degree  of  v.  The  other  cases  that  can  arise — 
crossings  of  two  edges  from  different  maps,  and  coinciding  vertices — are  not 
more  difficult  than  the  case  we  just  discussed.  These  cases  also  take  time  0(m), 
where  m  is  the  number  of  edges  incident  to  the  event  point.  This  means  that 
updating  D  does  not  increase  the  running  time  of  the  line  segment  intersection 
algorithm  asymptotically.  Notice  that  every  intersection  that  we  find  is  a  vertex 
of  the  overlay.  It  follows  that  the  vertex  records  and  the  half-edge  records  of  the 
doubly-connected  edge  list  for  0(8 1 , 82)  can  be  computed  in  0(nlogn  +  k\ogn) 
time,  where  n  denotes  the  sum  of  the  complexities  of  Si  and  82,  and  k  is  the 
complexity  of  the  overlay. 


Figure  2.5 

An  edge  of  one  subdivision  passing 
through  a  vertex  of  the  other 


first  clockwise  half-edge 
from  e'  with  v  as  its  origin 
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After  the  fields  involving  vertex  and  half-edge  records  have  been  set,  it  remains 
to  compute  the  information  about  the  faces  of  0(8i,§2).  More  precisely,  we 
have  to  create  a  face  record  for  each  face  /  in  0(§i,§2),  we  have  to  make 
OuterComponent(f)  point  to  a  half-edge  on  the  outer  boundary  of  /,  and  we 
have  to  make  a  list  InnerComponents(f)  of  pointers  to  half-edges  on  the  bound¬ 
aries  of  the  holes  inside  /.  Furthermore,  we  must  set  the  IncidentFace ()  fields 
of  the  half-edges  on  the  boundary  of  /  so  that  they  point  to  the  face  record  of  /. 
Finally,  each  of  the  new  faces  must  be  labeled  with  the  names  of  the  faces  in 
the  old  subdivisions  that  contain  it. 

How  many  face  records  will  there  be?  Well,  except  for  the  unbounded  face, 
every  face  has  a  unique  outer  boundary,  so  the  number  of  face  records  we  have 
to  create  is  equal  to  the  number  of  outer  boundaries  plus  one.  From  the  part  of 
the  doubly-connected  edge  list  we  have  constructed  so  far  we  can  easily  extract 
all  boundary  cycles.  But  how  do  we  know  whether  a  cycle  is  an  outer  boundary 
or  the  boundary  of  a  hole  in  a  face?  This  can  be  decided  by  looking  at  the 
leftmost  vertex  v  of  the  cycle,  or,  in  case  of  ties,  at  the  lowest  of  the  leftmost 
ones.  Recall  that  half-edges  are  directed  in  such  a  way  that  their  incident  face 
locally  lies  to  the  left.  Consider  the  two  half-edges  of  the  cycle  that  are  incident 
to  v.  Because  we  know  that  the  incident  face  lies  to  the  left,  we  can  compute 
the  angle  these  two  half-edges  make  inside  the  incident  face.  If  this  angle  is 
smaller  than  180°  then  the  cycle  is  an  outer  boundary,  and  otherwise  it  is  the 
boundary  of  a  hole.  This  property  holds  for  the  leftmost  vertex  of  a  cycle,  but 
not  necessarily  for  other  vertices  of  that  cycle. 

To  decide  which  boundary  cycles  bound  the  same  face  we  construct  a 
graph  3-  For  every  boundary  cycle — inner  and  outer — there  is  a  node  in  3. 
There  is  also  one  node  for  the  imaginary  outer  boundary  of  the  unbounded 
face.  There  is  an  arc  between  two  cycles  if  and  only  if  one  of  the  cycles  is  the 
boundary  of  a  hole  and  the  other  cycle  has  a  half-edge  immediately  to  the  left 
of  the  leftmost  vertex  of  that  hole  cycle.  If  there  is  no  half-edge  to  the  left  of  the 
leftmost  vertex  of  a  cycle,  then  the  node  representing  the  cycle  is  linked  to  the 
node  of  the  unbounded  face.  Figure  2.6  gives  an  example.  The  dotted  segments 
in  the  figure  indicate  the  linking  of  the  hole  cycles  to  other  cycles.  The  graph 
corresponding  to  the  subdivision  is  also  shown  in  the  figure.  The  hole  cycles 
are  shown  as  single  circles,  and  the  outer  boundary  cycles  are  shown  as  double 
circles.  Observe  that  C3  and  C(,  are  in  the  same  connected  component  as  Co. 
This  indicates  that  C3  and  Cf,  are  hole  cycles  in  the  face  whose  outer  boundary 
is  62-  If  there  is  only  one  hole  in  a  face  /,  then  the  graph  3  links  the  boundary 
cycle  of  the  hole  to  the  outer  boundary  of  /.  In  general  this  need  not  be  the  case: 
a  hole  can  also  be  linked  to  another  hole,  as  you  can  see  in  Figure  2.6.  This 
hole,  which  lies  in  the  same  face  /,  may  be  linked  to  the  outer  boundary  of  /, 
or  it  may  be  linked  to  yet  another  hole.  But  eventually  we  must  end  up  linking  a 
hole  to  the  outer  boundary,  as  the  next  lemma  shows. 


Lemma  2.5  Each  connected  component  of  the  graph  3  corresponds  exactly  to 
the  set  of  cycles  incident  to  one  face. 
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Proof.  Consider  a  cycle  C  bounding  a  hole  in  a  face  /.  Because  /  lies  locally 
to  the  left  of  the  leftmost  vertex  of  6,  6  must  be  linked  to  another  cycle  that  also 


Figure  2.6 

A  subdivision  and  the  corresponding 
graph  S 


bounds  /.  It  follows  that  cycles  in  the  same  connected  component  of  9  bound 
the  same  face. 

To  finish  the  proof,  we  show  that  every  cycle  bounding  a  hole  in  /  is  in 
the  same  connected  component  as  the  outer  boundary  of  /.  Suppose  there  is  a 
cycle  for  which  this  is  not  the  case.  Let  C  be  the  leftmost  such  cycle,  that  is,  the 
one  whose  the  leftmost  vertex  is  leftmost.  By  definition  there  is  an  arc  between 
the  6  and  another  cycle  6'  that  lies  partly  to  the  left  of  the  leftmost  vertex  of  C. 
Hence,  C'  is  in  the  same  connected  component  as  C,  which  is  not  the  component 
of  the  outer  boundary  of  /.  This  contradicts  the  definition  of  C.  0 

Lemma  2.5  shows  that  once  we  have  the  graph  9,  we  can  create  a  face  record 
for  every  component.  Then  we  can  set  the  IncidentFace ()  pointers  of  the  half¬ 
edges  that  bound  each  face  /,  and  we  can  construct  the  list  InnerComponents(f) 
and  the  set  OuterComponent(f).  How  can  we  construct  9?  Recall  that  in  the 
plane  sweep  algorithm  for  line  segment  intersection  we  always  looked  for  the 
segments  immediately  to  the  left  of  an  event  point.  (They  had  to  be  tested 
for  intersection  against  the  leftmost  edge  through  the  event  point.)  Hence,  the 
information  we  need  to  construct  9  is  determined  during  the  plane  sweep.  So, 
to  construct  9,  we  first  make  a  node  for  every  cycle.  To  find  the  arcs  of  9, 
we  consider  the  leftmost  vertex  v  of  every  cycle  bounding  a  hole.  If  e  is  the 
half-edge  immediately  left  of  v,  then  we  add  an  arc  between  the  two  nodes 
in  9  representing  the  cycle  containing  e  and  the  hole  cycle  of  which  v  is  the 
leftmost  vertex.  To  find  these  nodes  in  9  efficiently  we  need  pointers  from  every 
half-edge  record  to  the  node  in  9  representing  the  cycle  it  is  in.  So  the  face 
information  of  the  doubly-connected  edge  list  can  be  set  in  0(n  +  k)  additional 
time,  after  the  plane  sweep. 
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One  thing  remains:  each  face  /  in  the  overlay  must  be  labeled  with  the  names  of 
the  faces  in  the  old  subdivisions  that  contained  it.  To  find  these  faces,  consider  an 
arbitrary  vertex  v  of  /.  If  v  is  the  intersection  of  an  edge  e\  from  Si  and  an  edge 
ei  from  §2  then  we  can  decide  which  faces  of  Si  and  S2  contain  /  by  looking 
at  the  IncidentFace ()  pointer  of  the  appropriate  half-edges  corresponding  to  e\ 
and  <?2-  If  v  is  not  an  intersection  but  a  vertex  of,  say.  Si,  then  we  only  know 
the  face  of  Si  containing  /.  To  find  the  face  of  S2  containing  /,  we  have  to 
do  some  more  work:  we  have  to  determine  the  face  of  S2  that  contains  v.  In 
other  words,  if  we  knew  for  each  vertex  of  Si  in  which  face  of  S2  it  lay,  and 
vice  versa,  then  we  could  label  the  faces  of  0(81,82)  correctly.  How  can  we 
compute  this  information?  The  solution  is  to  apply  the  paradigm  that  has  been 
introduced  in  this  chapter,  plane  sweep,  once  more.  However,  we  won’t  explain 
this  final  step  here.  It  is  a  good  exercise  to  test  your  understanding  of  the  plane 
sweep  approach  to  design  the  algorithm  yourself.  (In  fact,  it  is  not  necessary  to 
compute  this  information  in  a  separate  plane  sweep.  It  can  also  be  done  in  the 
sweep  that  computes  the  intersections.) 


Putting  everything  together  we  get  the  following  algorithm. 

Algorithm  MapOverlay(Si,S2) 

Input.  Two  planar  subdivisions  Si  and  S2  stored  in  doubly-connected  edge  lists. 
Output.  The  overlay  of  Si  and  82  stored  in  a  doubly-connected  edge  list  T>. 

1.  Copy  the  doubly-connected  edge  lists  for  Si  and  S2  to  a  new  doubly- 
connected  edge  list  T>. 

2.  Compute  all  intersections  between  edges  from  Si  and  S2  with  the  plane 
sweep  algorithm  of  Section  2.1.  In  addition  to  the  actions  on  7  and  Q 
required  at  the  event  points,  do  the  following: 

■  Update  T>  as  explained  above  if  the  event  involves  edges  of  both  Si 
and  82-  (This  was  explained  for  the  case  where  an  edge  of  Si  passes 
through  a  vertex  of  S2.) 

■  Store  the  half-edge  immediately  to  the  left  of  the  event  point  at  the 
vertex  in  D  representing  it. 

3.  (*  Now  CD  is  the  doubly-connected  edge  list  for  0(Si , S2),  except  that  the 
information  about  the  faces  has  not  been  computed  yet.  *) 

4.  Determine  the  boundary  cycles  in  0 (S 1 , 82)  by  traversing  CD. 

5.  Construct  the  graph  9  whose  nodes  correspond  to  boundary  cycles  and 
whose  arcs  connect  each  hole  cycle  to  the  cycle  to  the  left  of  its  leftmost  ver¬ 
tex,  and  compute  its  connected  components.  (The  information  to  determine 
the  arcs  of  9  has  been  computed  in  line  2,  second  item.) 

6.  for  each  connected  component  in  9 

7.  do  Let  C  be  the  unique  outer  boundary  cycle  in  the  component  and  let 

/  denote  the  face  bounded  by  the  cycle.  Create  a  face  record  for  /, 
set  OuterComponent(f)  to  some  half-edge  of  6,  and  construct  the 
list  InnerComponents(f)  consisting  of  pointers  to  one  half-edge  in 
each  hole  cycle  in  the  component.  Let  the  IncidentFace  ()  pointers 
of  all  half-edges  in  the  cycles  point  to  the  face  record  of  /. 
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8.  Label  each  face  of  0(§i,§2)  with  the  names  of  the  faces  of  Si  and  S2 
containing  it,  as  explained  above. 

Theorem  2.6  Let  Si  be  a  planar  subdivision  of  complexity  n\,  let  S 2  be  a 
subdivision  of  complexity  112,  and  let  n  :=  n\  + 112-  The  overlay  of  Si  and  S 2 
can  be  constructed  in  0(nlogn  +  klogn)  time,  where  k  is  the  complexity  of  the 
overlay. 

Proof.  Copying  the  doubly-connected  edge  lists  in  line  1  takes  0(n)  time,  and 
the  plane  sweep  of  line  2  takes  0(n  log n  +  klogn)  time  by  Lemma  2.3.  Steps  4- 
7,  where  we  fill  in  the  face  records,  takes  time  linear  in  the  complexity  of 
0(Si ,  S2).  (The  connected  components  of  a  graph  can  be  determined  in  linear 
time  by  a  simple  depth  first  search.)  Finally,  labeling  each  face  in  the  resulting 
subdivision  with  the  faces  of  the  original  subdivisions  that  contain  it  can  be 
done  in  O(n\ogn  +  k\ogn)  time.  0 


2.4  Boolean  Operations 


The  map  overlay  algorithm  is  a  powerful  instrument  that  can  be  used  for  various 
other  applications.  One  particular  useful  one  is  performing  the  Boolean  opera¬ 
tions  union,  intersection,  and  difference  on  two  polygons  (Pi  and  3*2.  See  Figure 
2.7  for  an  example.  Note  that  the  output  of  the  operations  might  no  longer  be  a 
polygon.  It  can  consist  of  a  number  of  polygonal  regions,  some  with  holes. 


To  perform  the  Boolean  operation  we  regard  the  polygons  as  planar  maps 
whose  bounded  faces  are  labeled  (Pi  and  (P2,  respectively.  We  compute  the 
overlay  of  these  maps,  and  we  extract  the  faces  in  the  overlay  whose  labels 
correspond  to  the  particular  Boolean  operation  we  want  to  perform.  If  we  want 
to  compute  the  intersection  (Pi  D  (P2,  we  extract  the  faces  in  the  overlay  that  are 
labeled  with  (Pi  and  ON.  If  we  want  to  compute  the  union  03!  U  (P2,  we  extract  the 
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Figure  2. 7 

The  Boolean  operations  union, 
intersection  and  difference  on  two 
polygons  (Pi  and  ‘J’2 
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Chapter  2  faces  in  the  overlay  that  are  labeled  with  Ti  or  Ti.  And  if  we  want  to  compute 
line  segment  intersection  the  difference  Ti  \  T2,  we  extract  the  faces  in  the  overlay  that  are  labeled  with 

Ti  and  not  with  Ti. 

Because  every  intersection  point  of  an  edge  of  T |  and  an  edge  of  Ti  is 
a  vertex  of  T]  n  'JV  the  running  time  of  the  algorithm  is  Ofn  log/;  +  k  log/;), 
where  n  is  the  total  number  of  vertices  in  T;  and  T2,  and  k  is  the  complexity  of 
Ti  ft  T2.  The  same  holds  for  the  other  Boolean  operations:  every  intersection  of 
two  edges  is  a  vertex  of  the  final  result,  no  matter  which  operation  we  want  to 
perform.  We  immediately  get  the  following  result. 

Corollary  2.7  Let  T]  be  a  polygon  with  ti\  vertices  and  To  a  polygon  with  m 
vertices,  and  let  n  :=  ni  +  n,2 ■  Then  Ti  n  To,  Ti  U  To,  and  Ti  \  T2  can  each  be 
computed  in  0(nlogn  +  klogn)  time,  where  k  is  the  complexity  of  the  output. 


2.5  Notes  and  Comments 

The  line  segment  intersection  problem  is  one  of  the  most  fundamental  problems 
in  computational  geometry.  The  O(n\ogn  +  klogn)  solution  presented  in  this 
chapter  was  given  by  Bentley  and  Ottmann  [47]  in  1979.  (A  few  years  earlier, 
Shamos  and  Hoey  [351]  had  solved  the  detection  problem,  where  one  is  only 
interested  in  deciding  whether  there  is  at  least  one  intersection,  in  O(nlogn) 
time.)  The  method  for  reducing  the  working  storage  from  0(n  +  k)  to  O(n) 
described  in  this  chapter  is  taken  from  Pach  and  Sharir  [312],  who  also  show 
that  the  event  list  can  have  size  fl(n  logn)  before  this  improvement.  Brown  [77] 
describes  an  alternative  method  to  achieve  the  reduction. 

The  lower  bound  for  the  problem  of  reporting  all  line  segment  intersections 
is  Q.(nlogn  +k),  so  the  plane  sweep  algorithm  described  in  this  chapter  is 
not  optimal  when  k  is  large.  A  first  step  towards  an  optimal  algorithm  was 
taken  by  Chazelle  [88],  who  gave  an  algorithm  with  0(n  log2  n /  log  log/;  -\-k) 
running  time.  In  1988  Chazelle  and  Edelsbrunner  [99,  100]  presented  the  first 
0(n\ogn  +  k)  time  algorithm.  Unfortunately,  it  requires  0(n  +  k)  storage.  Later 
Clarkson  and  Shor  [133]  and  Mulmuley  [288]  gave  randomized  incremental 
algorithms  whose  expected  running  time  is  also  0(n  log  n  +  k).  (See  Chapter  4 
for  an  explanation  of  randomized  algorithms.)  The  working  storage  of  these 
algorithms  is  0(n)  and  0(n  +  k),  respectively.  Unlike  the  algorithm  of  Chazelle 
and  Edelsbrunner,  these  randomized  algorithms  also  work  for  computing  inter¬ 
sections  in  a  set  of  curves.  Balaban  [35]  gave  the  first  deterministic  algorithm 
for  the  segment  intersection  problem  that  works  in  0(n  log n  +  k)  time  and  0(n) 
space.  It  also  works  for  curves. 

There  are  cases  of  the  line  segment  intersection  problem  that  are  easier  than 
the  general  case.  One  such  case  is  where  we  have  two  sets  of  segments,  say 
red  segments  and  blue  segments,  such  that  no  two  segments  from  the  same 
set  intersect  each  other.  (This  is,  in  fact,  exactly  the  network  overlay  problem. 
In  the  solution  described  in  this  chapter,  however,  the  fact  that  the  segments 
came  from  two  sets  of  non-intersecting  segments  was  not  used.)  This  so-called 
red-blue  line  segment  intersection  problem  was  solved  in  0(n  logn  +  k)  time 
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and  0(n)  storage  by  Mairson  and  Stolfi  [262]  before  the  general  problem  was  Section  2.6 
solved  optimally.  Other  optimal  red-blue  intersection  algorithms  were  given  exercises 
by  Chazelle  et  al.  [101]  and  by  Palazzi  and  Snoeyink  [315].  If  the  two  sets  of 
segments  form  connected  subdivisions  then  the  situation  is  even  better:  in  this 
case  the  overlay  can  be  computed  in  0(n  +  k)  time,  as  has  been  shown  by  Finke 
and  Hinrichs  [176].  Their  result  generalizes  and  improves  previous  results  on 
map  overlay  by  Nievergelt  and  Preparata  [293],  Guibas  and  Seidel  [200],  and 
Mairson  and  Stolfi  [262], 

The  line  segment  intersection  counting  problem  is  to  determine  the  number 
of  intersection  points  in  a  set  of  n  line  segments.  Since  the  output  is  a  single 
integer,  a  term  with  k  in  the  time  bound  no  longer  refers  to  the  output  size 
(which  is  constant),  but  only  to  the  number  of  intersections.  Algorithms  that  do 
not  depend  on  the  number  of  intersections  take  0(n4/3logcn)  time,  for  some 
small  constant  c  [4,  95];  a  running  time  close  to  0{n logn)  is  not  known  to  exist. 


Plane  sweep  is  one  of  the  most  important  paradigms  for  designing  geometric 
algorithms.  The  first  algorithms  in  computational  geometry  based  on  this 
paradigm  are  by  Shamos  and  Hoey  [351],  Lee  and  Preparata  [250],  and  Bentley 
and  Ottmann  [47].  Plane  sweep  algorithms  are  especially  suited  for  finding 
intersections  in  sets  of  objects,  but  they  can  also  be  used  for  solving  many  other 
problems.  In  Chapter  3  plane  sweep  solves  part  of  the  polygon  triangulation 
problem,  and  in  Chapter  7  we  will  see  a  plane  sweep  algorithm  to  compute  the 
so-called  Voronoi  diagram  of  a  set  of  points.  The  algorithm  presented  in  the 
current  chapter  sweeps  a  horizontal  line  downwards  over  the  plane.  For  some 
problems  it  is  more  convenient  to  sweep  the  plane  in  another  way.  For  instance, 
we  can  sweep  the  plane  with  a  rotating  line — see  Chapter  15  for  an  example — or 
with  a  pseudo-line  (a  line  that  need  not  be  straight,  but  otherwise  behaves  more 
or  less  as  a  line)  [159].  The  plane  sweep  technique  can  also  be  used  in  higher 
dimensions:  here  we  sweep  the  space  with  a  hyperplane  [213,  311,  324].  Such 
algorithms  are  called  space  sweep  algorithms. 


In  this  chapter  we  described  a  data  structure  for  storing  subdivisions:  the  doubly- 
connected  edge  list.  This  structure,  or  in  fact  a  variant  of  it,  was  described  by 
Muller  and  Preparata  [286].  There  are  also  other  data  structures  for  storing 
subdivisions,  such  as  the  winged  edge  structure  by  Baumgart  [40]  and  the  quad 
edge  structure  by  Guibas  and  Stolfi  [202].  The  difference  between  all  these 
structures  is  small.  They  all  have  more  or  less  the  same  functionality,  but  some 
save  a  few  bytes  of  storage  per  edge. 


2.6  Exercises 

2.1  Let  S  be  a  set  of  n  disjoint  line  segments  whose  upper  endpoints  lie  on  the 
line  y  =  1  and  whose  lower  endpoints  lie  on  the  line  y  =  0.  These  segments 
partition  the  horizontal  strip  [— °°  :  °°]  x  [0  :  1]  into  n+  1  regions.  Give  an 
O(n\ogn)  time  algorithm  to  build  a  binary  search  tree  on  the  segments 
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Chapter  2  in  S  such  that  the  region  containing  a  query  point  can  be  determined  in 

line  segment  intersection  0(log«)  time.  Also  describe  the  query  algorithm  in  detail. 

2.2  The  intersection  detection  problem  for  a  set  S  of  n  line  segments  is  to 
determine  whether  there  exists  a  pair  of  segments  in  S  that  intersect.  Give 
a  plane  sweep  algorithm  that  solves  the  intersection  detection  problem  in 
0(nlogn)  time. 


2.3  Change  the  code  of  Algorithm  FlNDlNTERSECTlONS  (and  of  the  pro¬ 
cedures  that  it  calls)  such  that  the  working  storage  is  0(n)  instead  of 
0{n  +  k). 


2.4  Let  S  be  a  set  of  n  line  segments  in  the  plane  that  may  (partly)  overlap 
each  other.  For  example,  S  could  contain  the  segments  (0,0)  (1,0)  and 
( —  1 , 0)  (2, 0) .  We  want  to  compute  all  intersections  in  S.  More  precisely, 
we  want  to  compute  each  proper  intersection  of  two  segments  in  5  (that 
is,  each  intersection  of  two  non-parallel  segments)  and  for  each  end¬ 
point  of  a  segment  all  segments  containing  the  point.  Adapt  algorithm 
FlNDlNTERSECTlONS  to  this  end. 


2.5  Which  of  the  following  equalities  are  always  true? 


Twin(Twin(e)) 
Next(Prev(e)) 
Twin  ( Prev(  Twin  (e) ) ) 
IncidentFace  ( e ) 


e 

e 

Next(e ) 

IncidentFace  ( Next{e ) ) 


2.6  Give  an  example  of  a  doubly-connected  edge  list  where  for  an  edge  e  the 
faces  IncidentFace{e)  and  IncidentFace(Twin(e ))  are  the  same. 

2.7  Given  a  doubly-connected  edge  list  representation  of  a  subdivision  where 
Twin(e)  =  Next(e )  holds  for  every  half-edge  e,  how  many  faces  can  the 
subdivision  have  at  most? 

2.8  Give  pseudocode  for  an  algorithm  that  lists  all  vertices  adjacent  to  a 
given  vertex  v  in  a  doubly-connected  edge  list.  Also,  give  pseudocode 
for  an  algorithm  that  lists  all  edges  that  bound  a  face  in  a  not  necessarily 
connected  subdivision. 

2.9  Suppose  that  a  doubly-connected  edge  list  of  a  connected  subdivision  is 
given.  Give  pseudocode  for  an  algorithm  that  lists  all  faces  with  vertices 
that  appear  on  the  outer  boundary. 

2.10  Let  S  be  a  subdivision  of  complexity  n,  and  let  P  be  a  set  of  m  points.  Give 
a  plane  sweep  algorithm  that  computes  for  every  point  in  P  in  which  face 
of  S  it  is  contained.  Show  that  your  algorithm  runs  in  0((n  +  m)  log (n  + 
m))  time. 

2.1 1  Let  S  be  a  set  of  n  circles  in  the  plane.  Describe  a  plane  sweep  algorithm 
to  compute  all  intersection  points  between  the  circles.  (Because  we  deal 
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with  circles,  not  discs,  two  circles  do  not  intersect  if  one  lies  entirely  Section  2.6 
inside  the  other.)  Your  algorithm  should  run  in  0((n  +  k)  logn)  time,  exercises 
where  k  is  the  number  of  intersection  points. 

2.12  Let  S  be  a  set  of  n  triangles  in  the  plane.  The  boundaries  of  the  triangles 
are  disjoint,  but  it  is  possible  that  a  triangle  lies  completely  inside  another 
triangle.  Let  P  be  a  set  of  n  points  in  the  plane.  Give  an  O(nlogn) 
algorithm  that  reports  each  point  in  P  lying  outside  all  triangles. 


2.13*  Let  S  be  a  set  of  n  disjoint  triangles  in  the  plane.  We  want  to  find  a  set  of 
n  —  1  segments  with  the  following  properties: 

■  Each  segment  connects  a  point  on  the  boundary  of  one  triangle  to  a 
point  on  the  boundary  of  another  triangle. 

■  The  interiors  of  the  segments  are  pairwise  disjoint  and  they  are  disjoint 
from  the  triangles. 

■  Together  they  connect  all  triangles  to  each  other,  that  is,  by  walking 
along  the  segments  and  the  triangle  boundaries  it  must  be  possible  to 
walk  from  a  triangle  to  any  other  triangle. 

Develop  a  plane  sweep  algorithm  for  this  problem  that  runs  in  O(nlogn) 
time.  State  the  events  and  the  data  structures  that  you  use  explicitly,  and 
describe  the  cases  that  arise  and  the  actions  required  for  each  of  them. 
Also  state  the  sweep  invariant. 

2.14  Let  5  be  a  set  of  n  disjoint  line  segments  in  the  plane,  and  let  p  be  a 
point  not  on  any  of  the  line  segments  of  5.  We  wish  to  determine  all 
line  segments  of  S  that  p  can  see,  that  is,  all  line  segments  of  S  that 
contain  some  point  q  so  that  the  open  segment  J>q  doesn’t  intersect  any 
line  segment  of  5.  Give  an  0{n  logn)  time  algorithm  for  this  problem  that 
uses  a  rotating  half-line  with  its  endpoint  at  p. 


not  visible 


3  Polygon  Triangulation 

Guarding  an  Art  Gallery 


Works  of  famous  painters  are  not  only  popular  among  art  lovers,  but  also  among 
criminals.  They  are  very  valuable,  easy  to  transport,  and  apparently  not  so 
difficult  to  sell.  Art  galleries  therefore  have  to  guard  their  collections  carefully. 


Figure  3. 1 
An  art  gallery 


During  the  day  the  attendants  can  keep  a  look-out,  but  at  night  this  has  to  be 
done  by  video  cameras.  These  cameras  are  usually  hung  from  the  ceiling  and 
they  rotate  about  a  vertical  axis.  The  images  from  the  cameras  are  sent  to  TV 
screens  in  the  office  of  the  night  watch.  Because  it  is  easier  to  keep  an  eye  on 
few  TV  screens  rather  than  on  many,  the  number  of  cameras  should  be  as  small 
as  possible.  An  additional  advantage  of  a  small  number  of  cameras  is  that  the 
cost  of  the  security  system  will  be  lower.  On  the  other  hand  we  cannot  have 
too  few  cameras,  because  every  part  of  the  gallery  must  be  visible  to  at  least 
one  of  them.  So  we  should  place  the  cameras  at  strategic  positions,  such  that 
each  of  them  guards  a  large  part  of  the  gallery.  This  gives  rise  to  what  is  usually 
referred  to  as  the  Art  Gallery  Problem :  how  many  cameras  do  we  need  to  guard 
a  given  gallery  and  how  do  we  decide  where  to  place  them? 


Chapter 3  3.1  Guarding  and  Triangulations 

POLYGON  TRIANGULATION 

If  we  want  to  define  the  art  gallery  problem  more  precisely,  we  should  first 
formalize  the  notion  of  gallery.  A  gallery  is,  of  course,  a  3-dimensional  space, 
but  a  floor  plan  gives  us  enough  information  to  place  the  cameras.  Therefore  we 
model  a  gallery  as  a  polygonal  region  in  the  plane.  We  further  restrict  ourselves 
to  regions  that  are  simple  polygons,  that  is,  regions  enclosed  by  a  single  closed 
polygonal  chain  that  does  not  intersect  itself.  Thus  we  do  not  allow  regions  with 
holes.  A  camera  position  in  the  gallery  corresponds  to  a  point  in  the  polygon.  A 
camera  sees  those  points  in  the  polygon  to  which  it  can  be  connected  with  an 
open  segment  that  lies  in  the  interior  of  the  polygon. 

How  many  cameras  do  we  need  to  guard  a  simple  polygon?  This  clearly 
depends  on  the  polygon  at  hand:  the  more  complex  the  polygon,  the  more 
cameras  are  required.  We  shall  therefore  express  the  bound  on  the  number  of 
cameras  needed  in  terms  of  n,  the  number  of  vertices  of  the  polygon.  But  even 
when  two  polygons  have  the  same  number  of  vertices,  one  can  be  easier  to  guard 
than  the  other.  A  convex  polygon,  for  example,  can  always  be  guarded  with  one 
camera.  To  be  on  the  safe  side  we  shall  look  at  the  worst-case  scenario,  that  is, 
we  shall  give  a  bound  that  is  good  for  any  simple  polygon  with  n  vertices.  (It 
would  be  nice  if  we  could  find  the  minimum  number  of  cameras  for  the  specific 
polygon  we  are  given,  not  just  a  worst-case  bound.  Unfortunately,  the  problem 
of  finding  the  minimum  number  of  cameras  for  a  given  polygon  is  NP-hard.) 

Let  CP  be  a  simple  polygon  with  n  vertices.  Because  CP  may  be  a  complicated 
shape,  it  seems  difficult  to  say  anything  about  the  number  of  cameras  we  need 
to  guard  CP.  Hence,  we  first  decompose  CP  into  pieces  that  are  easy  to  guard, 
namely  triangles.  We  do  this  by  drawing  diagonals  between  pairs  of  vertices. 


Figure  3.2 

A  simple  polygon  and  a  possible 
triangulation  of  it 


A  diagonal  is  an  open  line  segment  that  connects  two  vertices  of  CP  and  lies  in 
the  interior  of  CP.  A  decomposition  of  a  polygon  into  triangles  by  a  maximal 
set  of  non-intersecting  diagonals  is  called  a  triangulation  of  the  polygon — see 
Figure  3.2.  (We  require  that  the  set  of  non-intersecting  diagonals  be  maximal  to 
ensure  that  no  triangle  has  a  polygon  vertex  in  the  interior  of  one  of  its  edges. 
This  could  happen  if  the  polygon  has  three  consecutive  collinear  vertices.) 
Triangulations  are  usually  not  unique;  the  polygon  in  Figure  3.2,  for  example, 
can  be  triangulated  in  many  different  ways.  We  can  guard  CP  by  placing  a  camera 
in  every  triangle  of  a  triangulation  CFp  of  CP.  But  does  a  triangulation  always 
exist?  And  how  many  triangles  can  there  be  in  a  triangulation?  The  following 
theorem  answers  these  questions. 
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Theorem  3.1  Every  simple  polygon  admits  a  triangulation,  and  any  triangula-  Section  3.1 

tion  of  a  simple  polygon  with  n  vertices  consists  of  exactly  n  —  2  triangles.  guarding  and  triangulations 


Proof.  We  prove  this  theorem  by  induction  on  n.  When  n  =  3  the  polygon  itself 
is  a  triangle  and  the  theorem  is  trivially  true.  Let  n  >  3,  and  assume  that  the 
theorem  is  true  for  all  m  <  n.  Let  7  be  a  polygon  with  n  vertices.  We  first  prove 
the  existence  of  a  diagonal  in  7.  Let  v  be  the  leftmost  vertex  of  7.  (In  case  of 
ties,  we  take  the  lowest  leftmost  vertex.)  Let  u  and  w  be  the  two  neighboring 
vertices  of  v  on  the  boundary  of  7.  If  the  open  segment  uw  lies  in  the  interior  of 
7,  we  have  found  a  diagonal.  Otherwise,  there  are  one  or  more  vertices  inside 
the  triangle  defined  by  u,  v,  and  w,  or  on  the  diagonal  uw.  Of  those  vertices,  let 
v'  be  the  one  farthest  from  the  line  through  u  and  w.  The  segment  connecting  v' 
to  v  cannot  intersect  an  edge  of  7,  because  such  an  edge  would  have  an  endpoint 
inside  the  triangle  that  is  farther  from  the  line  through  u  and  vv,  contradicting 
the  definition  of  v'.  Hence,  vv'  is  a  diagonal. 

So  a  diagonal  exists.  Any  diagonal  cuts  7  into  two  simple  subpolygons  7\ 
and  72-  Let  m\  be  the  number  of  vertices  of  7\  and  m2  the  number  of  vertices 
of  72-  Both  mi  and  m2  must  be  smaller  than  n,  so  by  induction  7\  and  7 2  can 
be  triangulated.  Hence,  7  can  be  triangulated  as  well. 

It  remains  to  prove  that  any  triangulation  of  7  consists  of  n  —  2  triangles.  To 
this  end,  consider  an  arbitrary  diagonal  in  some  triangulation  Ty.  This  diagonal 
cuts  7  into  two  subpolygons  with  mi  and  m2  vertices,  respectively.  Every 
vertex  of  7  occurs  in  exactly  one  of  the  two  subpolygons,  except  for  the  vertices 
defining  the  diagonal,  which  occur  in  both  subpolygons.  Hence,  m\  +m 2  =n  +  2. 
By  induction,  any  triangulation  of  7 ,•  consists  of  m,  —  2  triangles,  which  implies 
that  Ty  consists  of  (mi  —  2)  +  (m2  —  2)  =  n  —  2  triangles.  0 


Theorem  3. 1  implies  that  any  simple  polygon  with  n  vertices  can  be  guarded 
with  n—2  cameras.  But  placing  a  camera  inside  every  triangle  seems  overkill. 
A  camera  placed  on  a  diagonal,  for  example,  will  guard  two  triangles,  so  by 
placing  the  cameras  on  well-chosen  diagonals  we  might  be  able  to  reduce  the 
number  of  cameras  to  roughly  n/2.  Placing  cameras  at  vertices  seems  even 
better,  because  a  vertex  can  be  incident  to  many  triangles,  and  a  camera  at  that 
vertex  guards  all  of  them.  This  suggests  the  following  approach. 

Let  ‘.Lp  be  a  triangulation  of  7.  Select  a  subset  of  the  vertices  of  7,  such 
that  any  triangle  in  Ty  has  at  least  one  selected  vertex,  and  place  the  cameras  at 
the  selected  vertices.  To  find  such  a  subset  we  assign  each  vertex  of  7  a  color: 
white,  gray,  or  black.  The  coloring  will  be  such  that  any  two  vertices  connected 
by  an  edge  or  a  diagonal  have  different  colors.  This  is  called  a  3-coloring  of  a 
triangulated  polygon.  In  a  3-coloring  of  a  triangulated  polygon,  every  triangle 
has  a  white,  a  gray,  and  a  black  vertex.  Hence,  if  we  place  cameras  at  all  gray 
vertices,  say,  we  have  guarded  the  whole  polygon.  By  choosing  the  smallest 
color  class  to  place  the  cameras,  we  can  guard  7  using  at  most  [n/3j  cameras. 

But  does  a  3-coloring  always  exist?  The  answer  is  yes.  To  see  this,  we  look 
at  what  is  called  the  dual  graph  of  Ty.  This  graph  S(Ty)  has  a  node  for  every 
triangle  in  Ty.  We  denote  the  triangle  corresponding  to  a  node  v  by  t(v).  There 
is  an  arc  between  two  nodes  v  and  p  if  t(v)  and  f(ju)  share  a  diagonal.  The  arcs 
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Chapter  3 

POLYGON  TRIANGULATION 


in  S(T;p)  correspond  to  diagonals  in  Ty.  Because  any  diagonal  cuts  IP  into  two, 
the  removal  of  an  edge  from  S(Ty)  splits  the  graph  into  two.  Hence,  SfXp) 
is  a  tree.  (Notice  that  this  is  not  true  for  a  polygon  with  holes.)  This  means 
that  we  can  find  a  3-coloring  using  a  simple  graph  traversal,  such  as  depth  first 
search.  Next  we  describe  how  to  do  this.  While  we  do  the  depth  first  search, 
we  maintain  the  following  invariant:  all  vertices  of  the  already  encountered 
triangles  have  been  colored  white,  gray,  or  black,  and  no  two  connected  vertices 
have  received  the  same  color.  The  invariant  implies  that  we  have  computed  a 
valid  3-coloring  when  all  triangles  have  been  encountered.  The  depth  first  search 
can  be  started  from  any  node  of  S(7y);  the  three  vertices  of  the  corresponding 
triangle  are  colored  white,  gray,  and  black.  Now  suppose  that  we  reach  a  node 
V  in  S,  coming  from  node  p.  Hence,  1(  V)  and  t(p)  share  a  diagonal.  Since  the 
vertices  of  t(p)  have  already  been  colored,  only  one  vertex  of  t(v)  remains  to 
be  colored.  There  is  one  color  left  for  this  vertex,  namely  the  color  that  is  not 
used  for  the  vertices  of  the  diagonal  between  f(v)  and  t(p).  Because  S(Ty)  is 
a  tree,  the  other  nodes  adjacent  to  v  have  not  been  visited  yet,  and  we  still  have 
the  freedom  to  give  the  vertex  the  remaining  color. 


[n/3j  prongs 


We  conclude  that  a  triangulated  simple  polygon  can  always  be  3-colored.  As  a 
result,  any  simple  polygon  can  be  guarded  with  \  n/ 3J  cameras.  But  perhaps  we 
can  do  even  better.  After  all,  a  camera  placed  at  a  vertex  may  guard  more  than 
just  the  incident  triangles.  Unfortunately,  for  any  n  there  are  simple  polygons 
that  require  [n/3\  cameras.  An  example  is  a  comb-shaped  polygon  with  a  long 
horizontal  base  edge  and  |  n /3J  prongs  made  of  two  edges  each.  The  prongs  are 
connected  by  horizontal  edges.  The  construction  can  be  made  such  that  there  is 
no  position  in  the  polygon  from  which  a  camera  can  look  into  two  prongs  of  the 
comb  simultaneously.  So  we  cannot  hope  for  a  strategy  that  always  produces 
less  than  \  nj 3J  cameras.  In  other  words,  the  3-coloring  approach  is  optimal  in 
the  worst  case. 

We  just  proved  the  Art  Gallery  Theorem,  a  classical  result  from  combinato¬ 
rial  geometry. 


Theorem  3.2  (Art  Gallery  Theorem)  For  a  simple  polygon  with  n  vertices, 
|_n/3j  cameras  are  occasionally  necessary  and  always  sufficient  to  have  every 
point  in  the  polygon  visible  from  at  least  one  of  the  cameras. 

Now  we  know  that  \  n/ 3J  cameras  are  always  sufficient.  But  we  don’t  have 
an  efficient  algorithm  to  compute  the  camera  positions  yet.  What  we  need  is  a 
fast  algorithm  for  triangulating  a  simple  polygon.  The  algorithm  should  deliver 
a  suitable  representation  of  the  triangulation — a  doubly-connected  edge  list,  for 
instance — so  that  we  can  step  in  constant  time  from  a  triangle  to  its  neighbors. 
Given  such  a  representation,  we  can  compute  a  set  of  at  most  [w/3j  camera 
positions  in  linear  time  with  the  method  described  above:  use  depth  first  search 
on  the  dual  graph  to  compute  a  3-coloring  and  take  the  smallest  color  class 
to  place  the  cameras.  In  the  coming  sections  we  describe  how  to  compute  a 

_  triangulation  in  O(nlogn)  time.  Anticipating  this,  we  already  state  the  final 

48  result  about  guarding  a  polygon. 


Theorem  3.3  Let  CP  be  a  simple  polygon  with  n  vertices.  A  set  of  [n /3J  camera  Section  3.2 

positions  in  CP  such  that  any  point  inside  CP  is  visible  from  at  least  one  of  the  partitioning  a  polygon  into 

cameras  can  be  computed  in  O(nlogn)  time.  monotone  pieces 


3.2  Partitioning  a  Polygon  into  Monotone  Pieces 

Let  CP  be  a  simple  polygon  with  n  vertices.  We  saw  in  Theorem  3.1  that  a 
triangulation  of  CP  always  exists.  The  proof  of  that  theorem  is  constructive  and 
leads  to  a  recursive  triangulation  algorithm:  find  a  diagonal  and  triangulate 
the  two  resulting  subpolygons  recursively.  To  find  the  diagonal  we  take  the 
leftmost  vertex  of  CP  and  try  to  connect  its  two  neighbors  u  and  vv;  if  this  fails 
we  connect  v  to  the  vertex  farthest  from  uw  inside  the  triangle  defined  by  u, 
v,  and  w.  This  way  it  takes  linear  time  to  find  a  diagonal.  This  diagonal 
may  split  CP  into  a  triangle  and  a  polygon  with  n—  1  vertices.  Indeed,  if  we 
succeed  to  connect  u  and  vv  this  will  always  be  the  case.  As  a  consequence, 
the  triangulation  algorithm  will  take  quadratic  time  in  the  worst  case.  Can  we 
do  better?  For  some  classes  of  polygons  we  surely  can.  Convex  polygons,  for 
instance,  are  easy:  Pick  one  vertex  of  the  polygon  and  draw  diagonals  from 
this  vertex  to  all  other  vertices  except  its  neighbors.  This  takes  only  linear  time. 
So  a  possible  approach  to  triangulate  a  non-convex  polygon  would  be  to  first 
decompose  CP  into  convex  pieces,  and  then  triangulate  the  pieces.  Unfortunately, 
it  is  as  difficult  to  partition  a  polygon  into  convex  pieces  as  it  is  to  triangulate  it. 
Therefore  we  shall  decompose  CP  into  so-called  monotone  pieces,  which  turns 
out  to  be  a  lot  easier. 

A  simple  polygon  is  called  monotone  with  respect  to  a  line  t  if  for  any  line 
d  perpendicular  to  l  the  intersection  of  the  polygon  with  t'  is  connected.  In 
other  words,  the  intersection  should  be  a  line  segment,  a  point,  or  empty.  A 
polygon  that  is  monotone  with  respect  to  the  y-axis  is  called  y-monotone .  The 
following  property  is  characteristic  for  y-monotone  polygons:  if  we  walk  from 
a  topmost  to  a  bottommost  vertex  along  the  left  (or  the  right)  boundary  chain, 
then  we  always  move  downwards  or  horizontally,  never  upwards. 

Our  strategy  to  triangulate  the  polygon  CP  is  to  first  partition  CP  into  y-monotone 
pieces,  and  then  triangulate  the  pieces.  We  can  partition  a  polygon  into  mono¬ 
tone  pieces  as  follows.  Imagine  walking  from  the  topmost  vertex  of  CP  to  the 
bottommost  vertex  on  the  left  or  right  boundary  chain.  A  vertex  where  the 
direction  in  which  we  walk  switches  from  downward  to  upward  or  from  upward 
to  downward  is  called  a  turn  vertex.  To  partition  CP  into  y-monotone  pieces  we 
should  get  rid  of  these  turn  vertices.  This  can  be  done  by  adding  diagonals.  If 
at  a  turn  vertex  v  both  incident  edges  go  down  and  the  interior  of  the  polygon 
locally  lies  above  v,  then  we  must  choose  a  diagonal  that  goes  up  from  v.  The 
diagonal  splits  the  polygon  into  two.  The  vertex  v  will  appear  in  both  pieces. 
Moreover,  in  both  pieces  v  has  an  edge  going  down  (namely  on  original  edge 
of  CP)  and  an  edge  going  up  (the  diagonal).  Hence,  v  cannot  be  a  turn  vertex 
anymore  in  either  of  them.  If  both  incident  edges  of  a  turn  vertex  go  up  and 
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Figure  3.3 
Five  types  of  vertices 


the  interior  locally  lies  below  it,  we  have  to  choose  a  diagonal  that  goes  down. 
Apparently  there  are  different  types  of  turn  vertices.  Let’s  make  this  more 
precise. 

If  we  want  to  define  the  different  types  of  turn  vertices  carefully,  we  should 
pay  special  attention  to  vertices  with  equal  y-coordinate.  We  do  this  by  defining 
the  notions  of  “below”  and  “above”  as  follows:  a  point  p  is  below  another 
point  q  if  py  <  qy  or  py  =  qy  and  px  >  qx,  and  p  is  above  q  if  py  >  qy  or 
py  =  qy  and  px  <  qx.  (You  can  imagine  rotating  the  plane  slightly  in  clockwise 
direction  with  respect  to  the  coordinate  system,  such  that  no  two  points  have 
the  same  y-coordinate;  the  above/below  relation  we  just  defined  is  the  same  as 
the  above/below  relation  in  this  slightly  rotated  plane.) 


□  =  start  vertex 
■  =  end  vertex 
•  =  regular  vertex 
a  =  split  vertex 
▼  =  merge  vertex 


We  distinguish  five  types  of  vertices  in  CP — see  Figure  3.3.  Four  of  these 
types  are  turn  vertices:  start  vertices,  split  vertices,  end  vertices,  and  merge 
vertices.  They  are  defined  as  follows.  A  vertex  v  is  a  start  vertex  if  its  two 
neighbors  lie  below  it  and  the  interior  angle  at  v  is  less  than  n\  if  the  interior 
angle  is  greater  than  n  then  v  is  a  split  vertex.  (If  both  neighbors  lie  below 
v,  then  the  interior  angle  cannot  be  exactly  n.)  A  vertex  is  an  end  vertex  if 
its  two  neighbors  lie  above  it  and  the  interior  angle  at  v  is  less  than  n\  if  the 
interior  angle  is  greater  than  7Z  then  v  is  a  merge  vertex.  The  vertices  that 
are  not  turn  vertices  are  regular  vertices.  Thus  a  regular  vertex  has  one  of  its 
neighbors  above  it,  and  the  other  neighbor  below  it.  These  names  have  been 
chosen  because  the  algorithm  will  use  a  downward  plane  sweep,  maintaining  the 
intersection  of  the  sweep  line  with  the  polygon.  When  the  sweep  line  reaches 
a  split  vertex,  a  component  of  the  intersection  splits,  when  it  reaches  a  merge 
vertex,  two  components  merge,  and  so  on. 

The  split  and  merge  vertices  are  sources  of  local  non-monotonicity.  The 
following,  stronger  statement  is  even  true. 

Lemma  3.4  A  polygon  is  y-monotone  if  it  has  no  split  vertices  or  merge  vertices. 

Proof.  Suppose  CP  is  not  y-monotone.  We  have  to  prove  that  CP  contains  a  split 
or  a  merge  vertex. 
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Since  CP  is  not  monotone,  there  is  a  horizontal  line  f  that  intersects  CP  in 
more  than  one  connected  component.  We  can  choose  i  such  that  the  leftmost 
component  is  a  segment,  not  a  single  point.  Let  p  be  the  left  endpoint  of  this 
segment,  and  let  q  be  the  right  endpoint.  Starting  at  q ,  we  follow  the  boundary 
of  CP  such  that  CP  lies  to  the  left  of  the  boundary.  (This  means  that  we  go  up  from 
q.)  At  some  point,  let’s  call  it  r,  the  boundary  will  intersect  t  again.  If  r/  p,  as 
in  Figure  3.4(a),  then  the  highest  vertex  we  encountered  while  going  from  q  to  r 
must  be  a  split  vertex,  and  we  are  done. 


If  r  =  p,  as  in  Figure  3.4(b),  we  again  follow  the  boundary  of  CP  starting 
at  q ,  but  this  time  in  the  other  direction.  As  before,  the  boundary  will  intersect  L 
Let  r'  be  the  point  where  this  happens.  We  cannot  have  r  =  p,  because  that 
would  mean  that  the  boundary  of  CP  intersects  t  only  twice,  contradicting  that  t 
intersects  CP  in  more  than  one  component.  So  we  have  r'  p ,  implying  that  the 
lowest  vertex  we  have  encountered  while  going  from  q  to  r'  must  be  a  merge 
vertex.  0 

Lemma  3.4  implies  that  CP  has  been  partitioned  into  y-monotone  pieces  once 
we  get  rid  of  its  split  and  merge  vertices.  We  do  this  by  adding  a  diagonal  going 
upward  from  each  split  vertex  and  a  diagonal  going  downward  from  each  merge 
vertex.  These  diagonals  should  not  intersect  each  other,  of  course.  Once  we 
have  done  this,  CP  has  been  partitioned  into  y-monotonc  pieces. 


Let’s  first  see  how  we  can  add  the  diagonals  for  the  split  vertices.  We  use  a  plane 
sweep  method  for  this.  Let  vi,  V2, . . . ,  vn  be  a  counterclockwise  enumeration 
of  the  vertices  of  CP.  Let  e\ , . . .  ,e„  be  the  set  of  edges  of  CP,  where  e,  =  v,-v/+i 
for  1  ^  i  <n  and  <?„  =  vyLf  ■  The  plane  sweep  algorithm  moves  an  imaginary 
sweep  line  £  downward  over  the  plane.  The  sweep  line  halts  at  certain  event 
points.  In  our  case  these  will  be  the  vertices  of  CP;  no  new  event  points  will  be 
created  during  the  sweep.  The  event  points  are  stored  in  a  event  queue  Q.  The 
event  queue  is  a  priority  queue,  where  the  priority  of  a  vertex  is  its  y-coordinate. 
If  two  vertices  have  the  same  y-coordinate  then  the  leftmost  one  has  higher 
priority.  This  way  the  next  event  to  be  handled  can  be  found  in  O(logn)  time. 
(Because  no  new  events  are  generated  during  the  sweep,  we  could  also  sort  the 
vertices  on  y-coordinate  before  the  sweep,  and  then  use  the  sorted  list  to  find 
the  next  event  in  0(1)  time.) 
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Figure  3.4 

Two  cases  in  the  proof  of  Lemma  3.4 
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diagonal  will  be  added 
when  the  sweep  line 
reaches  vm 


The  goal  of  the  sweep  is  to  add  diagonals  from  each  split  vertex  to  a  vertex 
lying  above  it.  Suppose  that  the  sweep  line  reaches  a  split  vertex  v,.  To  which 
vertex  should  we  connect  v,?  A  good  candidate  is  a  vertex  close  to  v,,  because 
we  can  probably  connect  v,  to  this  vertex  without  intersecting  any  edge  of  IP. 
Let’s  make  this  more  precise.  Let  ej  be  the  edge  immediately  to  the  left  of  v, 
on  the  sweep  line,  and  let  be  the  edge  immediately  to  the  right  of  v,  on  the 
sweep  line.  Then  we  can  always  connect  v,-  to  the  lowest  vertex  in  between 
ej  and  e^.  and  above  v,.  If  there  is  no  such  vertex  then  we  can  connect  v,  to 
the  upper  endpoint  of  e;-  or  to  the  upper  endpoint  of  tj..  We  call  this  vertex  the 
helper  of  and  denote  it  by  helper(ej).  Formally,  helper(ej)  is  defined  as  the 
lowest  vertex  above  the  sweep  line  such  that  the  horizontal  segment  connecting 
the  vertex  to  ej  lies  inside  7.  Note  that  helper(ej)  can  be  the  upper  endpoint  of 
ej  itself. 

Now  we  know  how  to  get  rid  of  split  vertices:  connect  them  to  the  helper  of  the 
edge  to  their  left.  What  about  merge  vertices?  They  seem  more  difficult  to  get 
rid  of,  because  they  need  a  diagonal  to  a  vertex  that  is  lower  than  they  are.  Since 
the  part  of  CP  below  the  sweep  line  has  not  been  explored  yet,  we  cannot  add 
such  a  diagonal  when  we  encounter  a  merge  vertex.  Fortunately,  this  problem 
is  easier  than  it  seems  at  first  sight.  Suppose  the  sweep  line  reaches  a  merge 
vertex  v,-.  Let  ej  and  ty.  be  the  edges  immediately  to  the  right  and  to  the  left  of 
Vi  on  the  sweep  line,  respectively.  Observe  that  v,-  becomes  the  new  helper  of  ej 
when  we  reach  it.  We  would  like  to  connect  v,  to  the  highest  vertex  below  the 
sweep  line  in  between  ej  and  e^.  This  is  exactly  the  opposite  of  what  we  did 
for  split  vertices,  which  we  connected  to  the  lowest  vertex  above  the  sweep  line 
in  between  e,-  and  e/,.  This  is  not  surprising:  merge  vertices  are  split  vertices 
upside  down.  Of  course  we  don’t  know  the  highest  vertex  below  the  sweep  line 
when  we  reach  v,-.  But  it  is  easy  to  find  later  on:  when  we  reach  a  vertex  vm 
that  replaces  v,  as  the  helper  of  ej,  then  this  is  the  vertex  we  are  looking  for. 
So  whenever  we  replace  the  helper  of  some  edge,  we  check  whether  the  old 
helper  is  a  merge  vertex  and,  if  so,  we  add  the  diagonal  between  the  old  helper 
and  the  new  one.  This  diagonal  is  always  added  when  the  new  helper  is  a  split 
vertex,  to  get  rid  of  the  split  vertex.  If  the  old  helper  was  a  merge  vertex,  we 
thus  get  rid  of  a  split  vertex  and  a  merge  vertex  with  the  same  diagonal.  It  can 
also  happen  that  the  helper  of  ej  is  not  replaced  anymore  below  v,.  In  this  case 
we  can  connect  v,  to  the  lower  endpoint  of  er 

In  the  approach  above,  we  need  to  find  the  edge  to  the  left  of  each  vertex. 
Therefore  we  store  the  edges  of  7  intersecting  the  sweep  line  in  the  leaves 
of  a  dynamic  binary  search  tree  T.  The  left-to-right  order  of  the  leaves  of 
T  corresponds  to  the  left-to-right  order  of  the  edges.  Because  we  are  only 
interested  in  edges  to  the  left  of  split  and  merge  vertices  we  only  need  to  store 
edges  in  T  that  have  the  interior  of  7  to  their  right.  With  each  edge  in  7  we  store 
its  helper.  The  tree  T  and  the  helpers  stored  with  the  edges  form  the  status  of  the 
sweep  line  algorithm.  The  status  changes  as  the  sweep  line  moves:  edges  start 
or  stop  intersecting  the  sweep  line,  and  the  helper  of  an  edge  may  be  replaced. 

The  algorithm  partitions  7  into  subpolygons  that  have  to  be  processed 


further  in  a  later  stage.  To  have  easy  access  to  these  subpolygons  we  shall  store 
the  subdivision  induced  by  T  and  the  added  diagonals  in  a  doubly-connected 
edge  list  CD.  We  assume  that  CP  is  initially  specified  as  a  doubly-connected  edge 
list;  if  CP  is  given  in  another  form — by  a  counterclockwise  list  of  its  vertices,  for 
example — we  first  construct  a  doubly-connected  edge  list  for  CP.  The  diagonals 
computed  for  the  split  and  merge  vertices  are  added  to  the  doubly-connected 
edge  list.  To  access  the  doubly-connected  edge  list  we  use  cross-pointers 
between  the  edges  in  the  status  structure  and  the  corresponding  edges  in  the 
doubly-connected  edge  list.  Adding  a  diagonal  can  then  be  done  in  constant 
time  with  some  simple  pointer  manipulations.  The  global  algorithm  is  now  as 
follows. 

Algorithm  MakeMonotone(CP) 

Input.  A  simple  polygon  CP  stored  in  a  doubly-connected  edge  list  T>. 

Output.  A  partitioning  of  CP  into  monotone  subpolygons,  stored  in  CD. 

1 .  Construct  a  priority  queue  Q  on  the  vertices  of  CP,  using  their  v-coordinates 
as  priority.  If  two  points  have  the  same  v-coordinate,  the  one  with  smaller 
v-coordinate  has  higher  priority. 

2.  Initialize  an  empty  binary  search  tree  T. 

3.  while  Q  is  not  empty 

4.  do  Remove  the  vertex  v/  with  the  highest  priority  from  Q. 

5.  Call  the  appropriate  procedure  to  handle  the  vertex,  depending  on 
its  type. 

We  next  describe  more  precisely  how  to  handle  the  event  points.  You  should  first 
read  these  algorithms  without  thinking  about  degenerate  cases,  and  check  only 
later  that  they  are  also  correct  in  degenerate  cases.  (To  this  end  you  should  give 
an  appropriate  meaning  to  “directly  left  of’  in  line  1  of  HandleSplitVertex 
and  line  2  of  HandleMerge Vertex.)  There  are  always  two  things  we  must 
do  when  we  handle  a  vertex.  First,  we  must  check  whether  we  have  to  add  a 
diagonal.  This  is  always  the  case  for  a  split  vertex,  and  also  when  we  replace 
the  helper  of  an  edge  and  the  previous  helper  was  a  merge  vertex.  Second,  we 
must  update  the  information  in  the  status  structure  7.  The  precise  algorithms 
for  each  type  of  event  are  given  below.  You  can  use  the  example  figure  on  the 
next  page  to  see  what  happens  in  each  of  the  different  cases. 

Handles  tart  Vertex(v,) 

1.  Insert  e,-  in  7  and  set  helper(ei)  to  v,-. 

At  the  start  vertex  V5  in  the  example  figure,  for  instance,  we  insert  (’s  into  the 
tree  T. 

HandleEndVertex(v;) 

1.  if  helper(ej-i)  is  a  merge  vertex 

2.  then  Insert  the  diagonal  connecting  v,  to  helper(e,--  \ )  in  CD. 

3.  Delete  e(_i  from  T. 

In  the  running  example,  when  we  reach  end  vertex  V15,  the  helper  of  the  edge 
<2 14  is  v'14.  t'i4  is  not  a  merge  vertex,  so  we  don’t  need  to  insert  a  diagonal.  53 
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HandleSplitVertex(v,) 

1.  Search  in  7  to  find  the  edge  ej  directly  left  of  v,-. 

2.  Insert  the  diagonal  connecting  v,-  to  helper(ej)  in  CD. 

3.  helper(ej)  <—  v,- 

4.  Insert  e,-  in  7  and  set  helper(ei)  to  v,-. 

For  split  vertex  V14  in  our  example,  eg  is  the  edge  to  the  left.  Its  helper  is  vs,  so 
we  add  a  diagonal  from  V14  to  vs- 

H  andleMerge  Vertex(v,) 


1. 

if  helper(ej-i)  is  a  merge  vertex 

e\J 

2. 

then  Insert  the  diagonal  connecting  v,  to  helper(ej  .  \ )  in  CD. 

3. 

Delete  <?,_  i  from  T. 

j 

4. 

Search  in  7  to  find  the  edge  ej  directly  left  of  v,. 

V15 

5. 

if  helper(ej)  is  a  merge  vertex 

6. 

then  Insert  the  diagonal  connecting  v,  to  helper{ef)  in  CD. 

7. 

helper(ej)  <—  V; 

For  the  merge  vertex  vg  in  our  example,  the  helper  V2  of  edge  ej  is  a  merge 
vertex,  so  we  add  a  diagonal  from  vg  to  V2- 

The  only  routine  that  remains  to  be  described  is  the  one  to  handle  a  regular 
vertex.  The  actions  we  must  take  at  a  regular  vertex  depend  on  whether  CP  lies 
locally  to  its  left  or  to  its  right. 


HandleRegularVertex(v,) 

1 .  if  the  interior  of  CP  lies  to  the  right  of  v,- 

2.  then  if  helper(e,-  \ )  is  a  merge  vertex 

3.  then  Insert  the  diagonal  connecting  v;  to  helper(ei-i)  in  CD. 

4.  Delete  e,-_  i  from  T. 

5.  Insert  e,-  in  CT  and  set  helper(ei)  to  v,-. 

6.  else  Search  in  O'  to  find  the  edge  e  j  directly  left  of  v,-. 

7.  if  helper(ej)  is  a  merge  vertex 

8.  then  Insert  the  diagonal  connecting  v,-  to  helper[ef)  in  CD. 

9.  helper(ej)  <—  v;,- 

For  instance,  at  the  regular  vertex  vg  in  our  example,  we  add  a  diagonal  from  vg 
to  V'4. 


It  remains  to  prove  that  MakeMonotone  correctly  partitions  CP  into  monotone 
pieces. 

Lemma  3.5  Algorithm  MakeMonotone  adds  a  set  of  non-intersecting  diag¬ 
onals  that  partitions  CP  into  monotone  subpolygons. 

Proof.  It  is  easy  to  see  that  the  pieces  into  which  CP  is  partitioned  contain  no 
split  or  merge  vertices.  Hence,  they  are  monotone  by  Lemma  3.4.  It  remains 
to  prove  that  the  added  segments  are  valid  diagonals  (that  is,  that  they  don’t 
intersect  the  edges  of  CP)  and  that  they  don’t  intersect  each  other.  To  this 
end  we  will  show  that  when  a  segment  is  added,  it  intersects  neither  an  edge 


of  CP  nor  any  of  the  previously  added  segments.  We  shall  prove  this  for  the 
segment  added  in  HandleSplitVertex;  the  proof  for  the  segments  added 
inHANDLEENDVERTEX,  HandleRegular Vertex,  and  HandleMerge- 
Vertex  is  similar.  We  assume  that  no  two  vertices  have  the  same  y-coordinate; 
the  extension  to  the  general  case  is  fairly  straightforward. 

Consider  a  segment  vmv(-  that  is  added  by  HandleSplitVertex  when  v;,- 
is  reached.  Let  ej  be  the  edge  to  the  left  of  v,-,  and  let  et  be  the  edge  to  the  right 
of  V,-.  Thus  helper(ej)  =  vm  when  we  reach  v,-. 

We  first  argue  that  v,„Vj  does  not  intersect  an  edge  of  CP.  To  see  this,  consider 
the  quadrilateral  Q  bounded  by  the  horizontal  lines  through  vm  and  v,-,  and  by 
ej  and  c/,.  There  are  no  vertices  of  CP  inside  Q,  otherwise  vm  would  not  be  the 
helper  of  e Now  suppose  there  would  be  an  edge  of  CP  intersecting  V^vj.  Since 
the  edge  cannot  have  an  endpoint  inside  Q  and  polygon  edges  do  not  intersect 
each  other,  it  would  have  to  intersect  the  horizontal  segment  connecting  vm  to 
e.  j  or  the  horizontal  segment  connecting  v,-  to  e  j.  Both  are  impossible,  since  for 
both  vm  and  v/,  the  edge  ej  lies  immediately  to  the  left.  Hence,  no  edge  of  CP  can 
intersect  V^Vj. 

Now  consider  a  previously  added  diagonal.  Since  there  are  no  vertices  of  CP 
inside  Q,  and  any  previously  added  diagonal  must  have  both  of  its  endpoints 
above  v/,  it  cannot  intersect  vmv,-.  0 
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We  now  analyze  the  running  time  of  the  algorithm.  Constructing  the  priority 
queue  Q  takes  linear  time  and  initializing  T  takes  constant  time.  To  handle  an 
event  during  the  sweep,  we  perform  one  operation  on  Q,  at  most  one  query, 
one  insertion,  and  one  deletion  on  T,  and  we  insert  at  most  two  diagonals  into 
CD.  Priority  queues  and  balanced  search  trees  allow  for  queries  and  updates  in 
Ollogn)  time,  and  the  insertion  of  a  diagonal  into  CD  takes  0(1)  time.  Hence, 
handling  an  event  takes  Ollogn)  time,  and  the  total  algorithm  runs  in  O(nlogn) 
time.  The  amount  of  storage  used  by  the  algorithm  is  clearly  linear:  every  vertex 
is  stored  at  most  once  in  Q,  and  every  edge  is  stored  at  most  once  in  T.  Together 
with  Lemma  3.5  this  implies  the  following  theorem. 

Theorem  3.6  A  simple  polygon  with  n  vertices  can  be  partitioned  into  y- 
monotone  polygons  in  O(nlogn)  time  with  an  algorithm  that  uses  0(n)  storage. 


3.3  Triangulating  a  Monotone  Polygon 

We  have  just  seen  how  to  partition  a  simple  polygon  into  y-monotone  pieces 
in  O(nlogn)  time.  In  itself  this  is  not  very  interesting.  But  in  this  section  we 
show  that  monotone  polygons  can  be  triangulated  in  linear  time.  Together  these 
results  imply  that  any  simple  polygon  can  be  triangulated  in  O(nlogn)  time, 
a  nice  improvement  over  the  quadratic  time  algorithm  that  we  sketched  at  the 
beginning  of  the  previous  section. 

Let  CP  be  a  y-monotone  polygon  with  n  vertices.  Lor  the  moment  we  assume  _ 

that  CP  is  strictly  y-monotone,  that  is,  we  assume  that  CP  is  y-monotone  and  does  55 


Chapter  3  not  contain  horizontal  edges.  Thus  we  always  go  down  when  we  walk  on  the 

polygon  triangulation  left  or  right  boundary  chain  of  7  from  the  highest  vertex  of  CP  to  the  lowest 

one.  This  is  the  property  that  makes  triangulating  a  monotone  polygon  easy: 
we  can  work  our  way  through  CP  from  top  to  bottom  on  both  chains,  adding 
diagonals  whenever  this  is  possible.  Next  we  describe  the  details  of  this  greedy 
triangulation  algorithm. 


\  triangulated 
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The  algorithm  handles  the  vertices  in  order  of  decreasing  y-coordinate.  If  two 
vertices  have  the  same  y-coordinate,  then  the  leftmost  one  is  handled  first.  The 
algorithm  requires  a  stack  §  as  auxiliary  data  structure.  Initially  the  stack  is 
empty;  later  it  contains  the  vertices  of  CP  that  have  been  encountered  but  may 
still  need  more  diagonals.  When  we  handle  a  vertex  we  add  as  many  diagonals 
from  this  vertex  to  vertices  on  the  stack  as  possible.  These  diagonals  split  off 
triangles  from  CP.  The  vertices  that  have  been  handled  but  not  split  off — the 
vertices  on  the  stack — are  on  the  boundary  of  the  part  of  CP  that  still  needs  to  be 
triangulated.  The  lowest  of  these  vertices,  which  is  the  one  encountered  last,  is 
on  top  of  the  stack,  the  second  lowest  is  second  on  the  stack,  and  so  on.  The 
part  of  CP  that  still  needs  to  be  triangulated,  and  lies  above  the  last  vertex  that 
has  been  encountered  so  far,  has  a  particular  shape:  it  looks  like  a  funnel  turned 
upside  down.  One  boundary  of  the  funnel  consists  of  a  part  of  a  single  edge  of 
CP,  and  the  other  boundary  is  a  chain  consisting  of  reflex  vertices,  that  is,  the 
interior  angle  at  these  vertices  is  at  least  180°.  Only  the  highest  vertex,  which  is 
at  the  bottom  of  the  stack,  is  convex.  This  property  remains  true  after  we  have 
handled  the  next  vertex.  Hence,  it  is  an  invariant  of  the  algorithm. 

Now,  let’s  see  which  diagonals  we  can  add  when  we  handle  the  next  vertex. 
We  distinguish  two  cases:  v,-,  the  next  vertex  to  be  handled,  lies  on  the  same 
chain  as  the  reflex  vertices  on  the  stack,  or  it  lies  on  the  opposite  chain.  If  Vj 
lies  on  the  opposite  chain,  it  must  be  the  lower  endpoint  of  the  single  edge  e 
bounding  the  funnel.  Due  to  the  shape  of  the  funnel,  we  can  add  diagonals  from 
Vj  to  all  vertices  currently  on  the  stack,  except  for  the  last  one  (that  is,  the  one 
at  the  bottom  of  the  stack);  the  last  vertex  on  the  stack  is  the  upper  vertex  of  e, 
so  it  is  already  connected  to  vj.  All  these  vertices  are  popped  from  the  stack. 
The  untriangulated  part  of  the  polygon  above  vj  is  bounded  by  the  diagonal 
that  connects  vj  to  the  vertex  previously  on  top  of  the  stack  and  the  edge  of  7 
extending  downward  from  this  vertex,  so  it  looks  like  a  funnel  and  the  invariant 
is  preserved.  This  vertex  and  vj  remain  part  of  the  not  yet  triangulated  polygon, 
so  they  are  pushed  onto  the  stack. 

The  other  case  is  when  vj  is  on  the  same  chain  as  the  reflex  vertices  on  the 
stack.  This  time  we  may  not  be  able  to  draw  diagonals  from  vj  to  all  vertices  on 
the  stack.  Nevertheless,  the  ones  to  which  we  can  connect  vj  are  all  consecutive 
and  they  are  on  top  of  the  stack,  so  we  can  proceed  as  follows.  First,  pop  one 
vertex  from  the  stack;  this  vertex  is  already  connected  to  v  ,■  by  an  edge  of  7. 
Next,  pop  vertices  from  the  stack  and  connect  them  to  vj  until  we  encounter 
one  where  this  is  not  possible.  Checking  whether  a  diagonal  can  be  drawn  from 
Vj  to  a  vertex  vj.  on  the  stack  can  be  done  by  looking  at  Vj,  v*,  and  the  previous 
vertex  that  was  popped.  When  we  find  a  vertex  to  which  we  cannot  connect  Vj, 
we  push  the  last  vertex  that  has  been  popped  back  onto  the  stack.  This  is  either 


the  last  vertex  to  which  a  diagonal  was  added  or,  if  no  diagonals  have  been 
added,  it  is  the  neighbor  of  vj  on  the  boundary  of  CP — see  Figure  3.5.  After  this 

popped  and 
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TRIANGULATING  A  MONOTONE 
POLYGON 


Figure  3.5 

Two  cases  when  the  next  vertex  is  on 
the  same  side  as  the  reflex  vertices  on 
the  stack 


has  been  done  we  push  Vj  onto  the  stack.  In  both  cases  the  invariant  is  restored: 
one  side  of  the  funnel  is  bounded  by  a  part  of  a  single  edge,  and  the  other  side 
is  bounded  by  a  chain  of  reflex  vertices.  We  get  the  following  algorithm.  (The 
algorithm  is  actually  similar  to  the  convex  hull  algorithm  of  Chapter  1 .) 

Algorithm  TriangulateMonotonePolygon(CP) 

Input.  A  strictly  v-monotone  polygon  CP  stored  in  a  doubly-connected  edge 
list  CD. 

Output.  A  triangulation  of  CP  stored  in  the  doubly-connected  edge  list  CD. 

1 .  Merge  the  vertices  on  the  left  chain  and  the  vertices  on  the  right  chain  of  CP 
into  one  sequence,  sorted  on  decreasing  v-coordinate.  If  two  vertices  have 
the  same  y-coordinate,  then  the  leftmost  one  comes  first.  Let  u\ , . . . ,  un 
denote  the  sorted  sequence. 

2.  Initialize  an  empty  stack  §,  and  push  u\  and  U2  onto  it. 

3.  for  j  4—  3  to  n  —  1 

4.  do  if  u  j  and  the  vertex  on  top  of  8  are  on  different  chains 

5.  then  Pop  all  vertices  from  8. 

6.  Insert  into  CD  a  diagonal  from  uj  to  each  popped  vertex, 
except  the  last  one. 

7.  Push  Uj- 1  and  uj  onto  §. 

8.  else  Pop  one  vertex  from  8. 

9.  Pop  the  other  vertices  from  8  as  long  as  the  diagonals  from 
Uj  to  them  are  inside  CP.  Insert  these  diagonals  into  CD.  Push 
the  last  vertex  that  has  been  popped  back  onto  8. 

10.  Push  Uj  onto  8. 

1 1 .  Add  diagonals  from  u„  to  all  stack  vertices  except  the  first  and  the  last  one. 

How  much  time  does  the  algorithm  take?  Step  1  takes  linear  time  and  Step  2 
takes  constant  time.  The  for-loop  is  executed  n  —  3  times,  and  one  execution 
may  take  linear  time.  But  at  every  execution  of  the  for-loop  at  most  two  vertices 
are  pushed.  Hence,  the  total  number  of  pushes,  including  the  two  in  Step  2,  is 
bounded  by  2 n  —  4.  Because  the  number  of  pops  cannot  exceed  the  number  of 
pushes,  the  total  time  for  all  executions  of  the  for-loop  is  O(n).  The  last  step  of 
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Chapter  3  the  algorithm  also  takes  at  most  linear  time,  so  the  total  algorithm  runs  in  0(n) 

POLYGON  TRIANGULATION  time. 
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Theorem  3.7  A  strictly  y-monotone  polygon  with  n  vertices  can  be  triangulated 
in  linear  time. 

We  wanted  a  triangulation  algorithm  for  monotone  polygons  as  a  subroutine 
for  triangulating  arbitrary  simple  polygons.  The  idea  was  to  first  decompose 
a  polygon  into  monotone  pieces  and  then  to  triangulate  these  pieces.  It  seems 
that  we  have  all  the  ingredients  we  need.  There  is  one  problem,  however:  in 
this  section  we  have  assumed  that  the  input  is  a  strictly  y-monotone  polygon, 
whereas  the  algorithm  of  the  previous  section  may  produce  monotone  pieces 
with  horizontal  edges.  Recall  that  in  the  previous  section  we  treated  vertices 
with  the  same  y-coordinates  from  left  to  right.  This  had  the  same  effect  as  a 
slight  rotation  of  the  plane  in  clockwise  direction  such  that  no  two  vertices 
are  on  a  horizontal  line.  It  follows  that  the  monotone  subpolygons  produced 
by  the  algorithm  of  the  previous  section  are  strictly  monotone  in  this  slightly 
rotated  plane.  Hence,  the  triangulation  algorithm  of  the  current  section  operates 
correctly  if  we  treat  vertices  with  the  same  y-coordinate  from  left  to  right 
(which  corresponds  to  working  in  the  rotated  plane).  So  we  can  combine  the 
two  algorithms  to  obtain  a  triangulation  algorithm  that  works  for  any  simple 
polygon. 

How  much  time  does  the  triangulation  algorithm  take?  Decomposing  the 
polygon  into  monotone  pieces  takes  O(nlogn)  time  by  Theorem  3.6.  In  the 
second  stage  we  triangulate  each  of  the  monotone  pieces  with  the  linear-time 
algorithm  of  this  section.  Since  the  sum  of  the  number  of  vertices  of  the  pieces 
is  0(n ),  the  second  stage  takes  0(n)  time  in  total.  We  get  the  following  result. 

Theorem  3.8  A  simple  polygon  with  n  vertices  can  be  triangulated  in  0(n  log  n) 
time  with  an  algorithm  that  uses  O(n)  storage. 

We  have  seen  how  to  triangulate  simple  polygons.  But  what  about  polygons 
with  holes,  can  they  also  be  triangulated  easily?  The  answer  is  yes.  In  fact, 
the  algorithm  we  have  seen  also  works  for  polygons  with  holes:  nowhere  in 
the  algorithm  for  splitting  a  polygon  into  monotone  pieces  did  we  use  the  fact 
that  the  polygon  was  simple.  It  even  works  in  a  more  general  setting:  Suppose 
we  have  a  planar  subdivision  S  and  we  want  to  triangulate  that  subdivision. 
More  precisely,  if  B  is  a  bounding  box  containing  all  edges  of  S  in  its  interior, 
we  want  to  find  a  maximal  set  of  non-intersecting  diagonals — line  segments 
connecting  vertices  of  S  or  B  that  do  not  intersect  the  edges  of  S — that  partitions 
B  into  triangles.  Figure  3.6  shows  a  triangulated  subdivision.  The  edges  of 
the  subdivisions  and  of  the  bounding  box  are  shown  bold.  To  compute  such  a 
triangulation  we  can  use  the  algorithm  of  this  chapter:  first  split  the  subdivision 
into  monotone  pieces,  and  then  triangulate  the  pieces.  This  leads  to  the  following 
theorem. 

Theorem  3.9  A  planar  subdivision  with  n  vertices  in  total  can  be  triangulated 
in  O(nlogn)  time  with  an  algorithm  that  uses  O(n)  storage. 
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Figure  3.6 

A  triangulated  subdivision 


3.4  Notes  and  Comments 

The  Art  Gallery  Problem  was  posed  in  1973  by  Victor  Klee  in  a  conversation 
with  Vasek  Chvatal.  In  1975  Chvatal  [128]  gave  the  first  proof  that  |  n/3j 
cameras  are  always  sufficient  and  sometimes  necessary;  a  result  that  became 
known  as  the  Art  Gallery  Theorem  or  the  Watchman  Theorem.  Chvatal’s  proof 
is  quite  complicated.  The  much  simpler  proof  presented  in  this  chapter  was 
discovered  by  Fisk  [178].  His  proof  is  based  on  the  Two  Ears  Theorem  by 
Meisters  [277],  from  which  the  3-colorability  of  the  graph  that  is  a  triangulation 
of  a  simple  polygon  follows  easily.  The  algorithmic  problem  of  finding  the 
minimum  number  of  guards  for  a  given  simple  polygon  was  shown  to  be  NP- 
hard  by  Aggarwal  [10]  and  Lee  and  Lin  [246].  The  book  by  O’Rourke  [298] 
and  the  overview  by  Shermer  [355]  contain  an  extensive  treatment  of  the  Art 
Gallery  Problem  and  numerous  variations. 

A  decomposition  of  a  polygon,  or  any  other  region,  into  simple  pieces  is  useful  in 
many  problems.  Often  the  simple  pieces  are  triangles,  in  which  case  we  call  the 
decomposition  a  triangulation,  but  sometimes  other  shapes  such  as  quadrilaterals 
or  trapezoids  are  used — see  also  Chapters  6,  9,  and  14.  We  only  discuss  the 
results  on  triangulating  polygons  here.  The  linear  time  algorithm  to  triangulate  a 
monotone  polygon  described  in  this  chapter  was  given  by  Garey  et  al.  [188],  and 
the  plane  sweep  algorithm  to  partition  a  polygon  into  monotone  pieces  is  due  to 
Lee  and  Preparata  [250].  Avis  and  Toussaint  [32]  and  Chazelle  [85]  described 
other  algorithms  for  triangulating  a  simple  polygon  in  0{n\ogn)  time. 

For  a  long  time  one  of  the  main  open  problems  in  computational  geome¬ 
try  was  whether  simple  polygons  can  be  triangulated  in  o(n  log «)  time.  (For 
triangulating  subdivisions  with  holes  there  is  an  Tl(nlogn)  lower  bound.)  In 
this  chapter  we  have  seen  that  this  is  indeed  the  case  for  monotone  polygons. 
Linear-time  triangulation  algorithms  were  also  found  for  other  special  classes  of 
polygons  [108,  109,  170,  184,  214]  but  the  problem  for  general  simple  polygons 
remained  open  for  a  number  of  years.  In  1988  Tarjan  and  Van  Wyk  [368]  broke 
the  O(nlogn)  barrier  by  presenting  an  O(nloglogn)  algorithm.  Their  algorithm 
was  later  simplified  by  Kirkpatrick  et  al.  [237].  Randomization — an  approach 
used  in  Chapters  4,  6,  9,  and  1 1 — proved  to  be  a  good  tool  in  developing  even 


Chapter  3  faster  algorithms:  Clarkson  et  al.  [134],  Devillers  [141],  and  Seidel  [345]  pre- 

polygon  triangulation  sented  algorithms  with  0(n\og*  n)  running  time,  where  log'  n  is  the  iterated 

logarithm  of  n  (being  the  number  of  times  you  can  take  the  logarithm  before  the 
result  is  smaller  than  1).  These  algorithms  are  not  only  slightly  faster  than  the 
O(n\og\ogn)  algorithm,  but  also  simpler.  Seidel’s  algorithm  is  closely  related 
to  the  algorithm  for  constructing  a  trapezoidal  decomposition  of  a  planar  subdi¬ 
vision  described  in  Chapter  6.  However,  the  question  whether  a  simple  polygon 
can  be  triangulated  in  linear  time  was  still  open.  In  1990  this  problem  was  finally 
settled  by  Chazelle  [92,  94],  who  gave  a  (quite  complicated)  deterministic  linear 
time  algorithm.  A  randomized  linear  time  algorithm  was  developed  later  by 
Amato  et  al.  [15]. 

The  3-dimensional  equivalent  to  the  polygon  triangulation  problem  is  this:  de¬ 
compose  a  given  polytope  into  non-overlapping  tetrahedra,  where  the  vertices  of 
the  tetrahedra  must  be  vertices  of  the  original  polytope.  Such  a  decomposition  is 
called  a  tetrahedralization  of  the  polytope.  This  problem  is  much  more  difficult 
than  the  two-dimensional  version.  In  fact,  it  is  not  always  possible  to  decompose 
a  polytope  into  tetrahedra  without  using  additional  vertices.  Chazelle  [86]  has 
shown  that  for  a  simple  polytope  with  n  vertices,  ®(n2)  additional  vertices  may 
be  needed  and  are  always  sufficient  to  obtain  a  decomposition  into  tetrahedra. 
This  bound  was  refined  by  Chazelle  and  Palios  [110]  to  0(n  +  r2),  where  r  is  the 
number  of  reflex  edges  of  the  polytope.  The  algorithm  to  compute  the  decompo¬ 
sition  runs  in  0(nr  +  r2\ogr)  time.  Deciding  whether  a  given  simple  polytope 
can  be  tetrahedralized  without  additional  vertices  is  NP-complete  [330], 
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3.5  Exercises 

3.1  Prove  that  any  polygon  admits  a  triangulation,  even  if  it  has  holes.  Can 
you  say  anything  about  the  number  of  triangles  in  the  triangulation? 

3.2  A  rectilinear  polygon  is  a  simple  polygon  of  which  all  edges  are  horizontal 
or  vertical.  Let  7  be  a  rectilinear  polygon  with  n  vertices.  Give  an  example 
to  show  that  [n /4J  cameras  are  sometimes  necessary  to  guard  it. 

3.3  Prove  or  disprove:  The  dual  graph  of  the  triangulation  of  a  monotone 
polygon  is  always  a  chain,  that  is,  any  node  in  this  graph  has  degree  at 
most  two. 

3.4  Suppose  that  a  simple  polygon  7  with  n  vertices  is  given,  together  with  a 
set  of  diagonals  that  partitions  7  into  convex  quadrilaterals.  How  many 
cameras  are  sufficient  to  guard  71  Why  doesn’t  this  contradict  the  Art 
Gallery  Theorem? 

3.5  Give  the  pseudo-code  of  the  algorithm  to  compute  a  3-coloring  of  a 
triangulated  simple  polygon.  The  algorithm  should  run  in  linear  time. 


3.6  Give  an  algorithm  that  computes  in  O(nlogn)  time  a  diagonal  that  splits  Section  3.5 
a  simple  polygon  with  n  vertices  into  two  simple  polygons  each  with  at  exercises 
most  |J2n/3j  +2  vertices.  Hint:  Use  the  dual  graph  of  a  triangulation. 

3.7  Let  (P  be  a  simple  polygon  with  n  vertices,  which  has  been  partitioned 
into  monotone  pieces.  Prove  that  the  sum  of  the  number  of  vertices  of  the 
pieces  is  0(n). 

3.8  The  algorithm  given  in  this  chapter  to  partition  a  simple  polygon  into 
monotone  pieces  constructs  a  doubly-connected  edge  list  for  the  parti¬ 
tioned  polygon.  During  the  algorithm,  new  edges  are  added  to  the  DCEL 
(namely,  diagonals  to  get  rid  of  split  and  merge  vertices).  In  general, 
adding  an  edge  to  a  DCEL  cannot  be  done  in  constant  time.  Discuss 
why  adding  an  edge  may  take  more  than  constant  time,  and  argue  that  in 
the  polygon-partitioning  algorithm  we  can  add  a  diagonal  in  0(1)  time 
nevertheless. 

3.9  Show  that  if  a  polygon  has  0(1)  turn  vertices,  then  the  algorithm  given  in 
this  chapter  can  be  made  to  run  in  0(n)  time. 

3.10  Can  the  algorithm  of  this  chapter  also  be  used  to  triangulate  a  set  of  n 
points?  If  so,  explain  how  to  do  this  efficiently. 

3.11  Give  an  efficient  algorithm  to  determine  whether  a  polygon  IP  with  n 

vertices  is  monotone  with  respect  to  some  line,  not  necessarily  a  horizontal 

.  .  pockets 

or  vertical  one. 

3.12  The  pockets  of  a  simple  polygon  are  the  areas  outside  the  polygon,  but 
inside  its  convex  hull.  Let  (Pi  be  a  simple  polygon  with  m  vertices,  and 
assume  that  a  triangulation  of  (Pi  as  well  as  its  pockets  is  given.  Let  (Pi 
be  a  convex  polygon  with  n  vertices.  Show  that  the  intersection  (Pj  n(Pi 
can  be  computed  in  0(m  +  n)  time. 

3.13  The  stabbing  number  of  a  triangulated  simple  polygon  (P  is  the  maximum 
number  of  diagonals  intersected  by  any  line  segment  interior  to  (P.  Give 
an  algorithm  that  computes  a  triangulation  of  a  convex  polygon  that  has 
stabbing  number  0(log«). 

3.14  Given  a  simple  polygon  (P  with  n  vertices  and  a  point  p  inside  it,  show 
how  to  compute  the  region  inside  (P  that  is  visible  from  p. 
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4  Linear  Programming 

Manufacturing  with  Molds 


Most  objects  we  see  around  us  today — from  car  bodies  to  plastic  cups  and 
cutlery — are  made  using  some  form  of  automated  manufacturing.  Computers 
play  an  important  role  in  this  process,  both  in  the  design  phase  and  in  the 
construction  phase;  CAD/CAM  facilities  are  a  vital  part  of  any  modem  factory. 
The  construction  process  used  to  manufacture  a  specific  object  depends  on 
factors  such  as  the  material  the  object  should  be  made  of,  the  shape  of  the  object, 
and  whether  the  object  will  be  mass  produced.  In  this  chapter  we  study  some 
geometric  aspects  of  manufacturing  with  molds,  a  commonly  used  process  for 
plastic  or  metal  objects.  For  metal  objects  this  process  is  often  referred  to  as 
casting. 


Figure  4. 1 

The  casting  process 


Figure  4.1  illustrates  the  casting  process:  liquid  metal  is  poured  into  a  mold,  it 
solidifies,  and  then  the  object  is  removed  from  the  mold.  The  last  step  is  not 
always  as  easy  as  it  seems;  the  object  could  be  stuck  in  the  mold,  so  that  it 
cannot  be  removed  without  breaking  the  mold.  Sometimes  we  can  get  around 
this  problem  by  using  a  different  mold.  There  are  also  objects,  however,  for 
which  no  good  mold  exists;  a  sphere  is  an  example.  This  is  the  problem  we 
shall  study  in  this  chapter:  given  an  object,  is  there  a  mold  for  it  from  which  it 
can  be  removed? 

We  shall  confine  ourselves  to  the  following  situation.  First  of  all,  we  assume 
that  the  object  to  be  constructed  is  polyhedral.  Secondly,  we  only  consider 
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Chapter  4  molds  of  one  piece,  not  molds  consisting  of  two  or  more  pieces.  (Using  molds 

linear  programming  consisting  of  two  pieces,  it  is  possible  to  manufacture  objects  such  as  spheres, 

which  cannot  be  manufactured  using  a  mold  of  a  single  piece.)  Finally,  we 
only  allow  the  object  to  be  removed  from  the  mold  by  a  single  translation.  This 
means  that  we  will  not  be  able  to  remove  a  screw  from  its  mold.  Fortunately, 
translational  motions  suffice  for  many  objects. 
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4.1  The  Geometry  of  Casting 

If  we  want  to  determine  whether  an  object  can  be  manufactured  by  casting, 
we  have  to  find  a  suitable  mold  for  it.  The  shape  of  the  cavity  in  the  mold  is 
determined  by  the  shape  of  the  object,  but  different  orientations  of  the  object 
give  rise  to  different  molds.  Choosing  the  orientation  can  be  crucial:  some 
orientations  may  give  rise  to  molds  from  which  the  object  cannot  be  removed, 
while  other  orientations  allow  removal  of  the  object.  One  obvious  restriction  on 
the  orientation  is  that  the  object  must  have  a  horizontal  top  facet.  This  facet  will 
be  the  only  one  not  in  contact  with  the  mold.  Hence,  there  are  as  many  potential 
orientations — or,  equivalently,  possible  molds — as  the  object  has  facets.  We  call 
an  object  castable  if  it  can  be  removed  from  its  mold  for  at  least  one  of  these 
orientations.  In  the  following  we  shall  concentrate  on  determining  whether  an 
object  is  removable  by  a  translation  from  a  specific  given  mold.  To  decide  on 
the  castability  of  the  object  we  then  simply  try  every  potential  orientation. 

Let  (P  be  a  3-dimensional  polyhedron — that  is,  a  3-dimensional  solid  bounded 
by  planar  facets — with  a  designated  top  facet.  (We  shall  not  try  to  give  a  precise, 
formal  definition  of  a  polyhedron.  Giving  such  a  definition  is  tricky  and  not 
necessary  in  this  context.)  We  assume  that  the  mold  is  a  rectangular  block 
with  a  cavity  that  corresponds  exactly  to  IP.  When  the  polyhedron  is  placed  in 
the  mold,  its  top  facet  should  be  coplanar  with  the  topmost  facet  of  the  mold, 
which  we  assume  to  be  parallel  to  the  xy-plane.  This  means  that  the  mold  has 
no  unnecessary  parts  sticking  out  on  the  top  that  might  prevent  IP  from  being 
removed. 

We  call  a  facet  of  (P  that  is  not  the  top  facet  an  ordinary  facet.  Every  ordinary 
facet  /  has  a  corresponding  facet  in  the  mold,  which  we  denote  by  /. 

We  want  to  decide  whether  IP  can  be  removed  from  its  mold  by  a  single  transla¬ 
tion.  In  other  words,  we  want  to  decide  whether  a  direction  d  exists  such  that 
7  can  be  translated  to  infinity  in  direction  d  without  intersecting  the  interior 
of  the  mold  during  the  translation.  Note  that  we  allow  7  to  slide  along  the 
mold.  Because  the  facet  of  7  not  touching  the  mold  is  its  top  facet,  the  removal 
direction  has  to  be  upward,  that  is,  it  must  have  a  positive  ^-component.  This  is 
only  a  necessary  condition  on  the  removal  direction;  we  need  more  constraints 
to  be  sure  that  a  direction  is  valid. 

Let  /  be  an  ordinary  facet  of  7.  This  facet  must  move  away  from,  or  slide 
along,  its  corresponding  facet  /  of  the  mold.  To  make  this  constraint  precise, 
we  need  to  define  the  angle  of  two  vectors  in  3-space.  We  do  this  as  follows. 


Take  the  plane  spanned  by  the  vectors  (we  assume  both  vectors  are  rooted  at 
the  origin);  the  angle  of  the  vectors  is  the  smaller  of  the  two  angles  measured  in 
this  plane.  Now  /  blocks  any  translation  in  a  direction  making  an  angle  of  less 
than  90°  with  rj(/),  the  outward  normal  of  /.  So  a  necessary  condition  on  d  is 
that  it  makes  an  angle  of  at  least  90°  with  the  outward  normal  of  every  ordinary 
facet  of  7.  The  next  lemma  shows  that  this  condition  is  also  sufficient. 

Lemma  4.1  The  polyhedron  7  can  be  removed  from  its  mold  by  a  translation 
in  direction  d  if  and  only  if  d  makes  an  angle  of  at  least  90°  with  the  outward 
normal  of  all  ordinary  facets  of  7. 

Proof.  The  “only  if”  part  is  easy:  if  d  made  an  angle  less  than  90°  with  some 
outward  normal  rj(/),  then  any  point  q  in  the  interior  of  /  collides  with  the 
mold  when  translated  in  direction  d. 

To  prove  the  “if”  part,  suppose  that  at  some  moment  7  collides  with  the 
mold  when  translated  in  direction  d.  We  have  to  show  that  there  must  be  an 
outward  normal  making  an  angle  of  less  than  90°  with  d.  Let  p  be  a  point  of 
7  that  collides  with  a  facet  /  of  the  mold.  This  means  that  p  is  about  to  move 
into  the  interior  of  the  mold,  so  rj  (/),  the  outward  normal  of  /,  must  make  an 
angle  greater  than  90°  with  d.  But  then  d  makes  an  angle  less  than  90°  with  the 
outward  normal  of  the  ordinary  facet  f  of  7  that  corresponds  to  /.  0 

Lemma  4.1  has  an  interesting  consequence:  if  7  can  be  removed  by  a 
sequence  of  small  translations,  then  it  can  be  removed  by  a  single  translation. 
So  allowing  for  more  than  one  translation  does  not  help  in  removing  the  object 
from  its  mold. 

We  are  left  with  the  task  of  finding  a  direction  d  that  makes  an  angle  of  at 
least  90°  with  the  outward  normal  of  each  ordinary  facet  of  7.  A  direction  in 
3-dimensional  space  can  be  represented  by  a  vector  rooted  at  the  origin.  We 
already  know  that  we  can  restrict  our  attention  to  directions  with  a  positive 
z-component.  We  can  represent  all  such  directions  as  points  in  the  plane  z  =  1, 
where  the  point  (x,y,  1)  represents  the  direction  of  the  vector  (x,y,  1).  This  way 
every  point  in  the  plane  z  =  1  represents  a  unique  direction,  and  every  direction 
with  a  positive  z- value  is  represented  by  a  unique  point  in  that  plane. 

Lemma  4. 1  gives  necessary  and  sufficient  conditions  on  the  removal  direc¬ 
tion  d.  How  do  these  conditions  translate  into  our  plane  of  directions?  Let 
rj  =  (Tjj,Tjy,fjz)  be  the  outward  normal  of  an  ordinary  facet.  The  direction 
d  =  (dx.  dy.  1)  makes  an  angle  at  least  90°  with  rj  if  and  only  if  the  dot  product 
of  d  and  rj  is  non-positive.  Hence,  an  ordinary  facet  induces  a  constraint  of  the 
form 

Uxd\  T  tjydy  — |-  rjz  Z  0. 

This  inequality  describes  a  half-plane  on  the  plane  z  =  1,  that  is,  the  area  left  or 
the  area  right  of  a  line  on  the  plane.  (This  last  statement  is  not  true  for  horizontal 
facets,  which  have  rjx  =  rjv  =  0.  In  this  case  the  constraint  is  either  impossible 
to  satisfy  or  always  satisfied,  which  is  easy  to  test.)  Hence,  every  non -horizontal 
facet  of  7  defines  a  closed  half-plane  on  the  plane  z  =  1,  and  any  point  in  the 
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Chapter  4  common  intersection  of  these  half-planes  corresponds  to  a  direction  in  which  IP 
linear  programming  can  be  removed.  The  common  intersection  of  these  half-planes  may  be  empty; 
in  this  case  V  cannot  be  removed  from  the  given  mold. 


We  have  transformed  our  manufacturing  problem  to  a  purely  geometric  problem 
in  the  plane:  given  a  set  of  half-planes,  find  a  point  in  their  common  intersection 
or  decide  that  the  common  intersection  is  empty.  If  the  polyhedron  to  be 
manufactured  has  n  facets,  then  the  planar  problem  has  at  most  n—  1  half-planes 
(the  top  facet  does  not  induce  a  half-plane).  In  the  next  sections  we  will  see 
that  the  planar  problem  just  stated  can  be  solved  in  expected  linear  time — see 
Section  4.4,  where  also  the  meaning  of  “expected”  is  explained. 

Recall  that  the  geometric  problem  corresponds  to  testing  whether  IP  can  be 
removed  from  a  given  mold.  If  this  is  impossible,  there  can  still  be  other  molds, 
corresponding  to  different  choices  of  the  top  facet,  from  which  IP  is  removable. 
In  order  to  test  whether  IP  is  castable,  we  try  all  its  facets  as  top  facets.  This 
leads  to  the  following  result. 

Theorem  4.2  Let  IP  be  a  polyhedron  with  n  facets.  In  0{n2)  expected  time  and 
using  0(n)  storage  it  can  be  decided  whether  (P  is  castable.  Moreover,  if  IP  is 
castable,  a  mold  and  a  valid  direction  for  removing  (P  from  it  can  be  computed 
in  the  same  amount  of  time. 


4.2  Half-Plane  Intersection 

Let  H  =  {hi ,  /?2, . . . ,  h, ,}  be  a  set  of  linear  constraints  in  two  variables,  that  is, 
constraints  of  the  form 

ciiX  +  bjy  R  Ci, 

where  a,,  b,,  and  c,  are  constants  such  that  at  least  one  of  a,  and  b,  is  non-zero. 
Geometrically,  we  can  interpret  such  a  constraint  as  a  closed  half-plane  in  E2, 
bounded  by  the  line  aix+biy  =  Ci.  The  problem  we  consider  in  this  section  is  to 
find  the  set  of  all  points  (x,y)  G  R2  that  satisfy  all  n  constraints  at  the  same  time. 
In  other  words,  we  want  to  find  all  the  points  lying  in  the  common  intersection 
of  the  half-planes  in  H.  (In  the  previous  section  we  reduced  the  casting  problem 
to  finding  some  point  in  the  intersection  of  a  set  of  half-planes.  The  problem  we 
study  now  is  more  general.) 

The  shape  of  the  intersection  of  a  set  of  half-planes  is  easy  to  determine:  a 
half-plane  is  convex,  and  the  intersection  of  convex  sets  is  again  a  convex 
set,  so  the  intersection  of  a  set  of  half-planes  is  a  convex  region  in  the  plane. 
Every  point  on  the  intersection  boundary  must  lie  on  the  bounding  line  of  some 
half-plane.  Hence,  the  boundary  of  the  region  consists  of  edges  contained  in 
these  bounding  lines.  Since  the  intersection  is  convex,  every  bounding  line  can 
contribute  at  most  one  edge.  It  follows  that  the  intersection  of  n  half-planes 
is  a  convex  polygonal  region  bounded  by  at  most  n  edges.  Figure  4.2  shows 
a  few  examples  of  intersections  of  half-planes.  To  which  side  of  its  bounding 
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line  a  half-plane  lies  is  indicated  by  dark  shading  in  the  figure;  the  common 
intersection  is  shaded  lightly.  As  you  can  see  in  Figures  4.2  (ii)  and  (iii),  the 
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Figure  4.2 

Examples  of  the  intersection  of 
half-planes 


intersection  does  not  have  to  be  bounded.  The  intersection  can  also  degenerate 
to  a  line  segment  or  a  point,  as  in  (iv),  or  it  can  be  empty,  as  in  (v). 

We  give  a  rather  straightforward  divide-and-conquer  algorithm  to  compute  the 
intersection  of  a  set  of  n  half-planes.  It  is  based  on  a  routine  IntersectCon- 
VEXRegions  to  compute  the  intersection  of  two  convex  polygonal  regions.  We 
first  give  the  overall  algorithm. 

Algorithm  IntersectHalfplanes(//) 

Input.  A  set  H  of  n  half-planes  in  the  plane. 

Output.  The  convex  polygonal  region  C  :=  f)hc/i  h. 

1.  ifcard(H)  =  l 

2.  then  C  <—  the  unique  half-plane  h  a  // 

3.  else  Split  H  into  sets  H\  and  H2  of  size  \n/2\  and  \n/2\ . 

4.  Ci  <—  IntersectHalfplanes(7/i) 

5.  C2  <—  iNTERSECTHALFPLANESf/ii) 

6.  C  ^ —  Inters  ectC  on  vexRegions  (Cj ,  C>) 

What  remains  is  to  describe  the  procedure  INTERSECTCONVEXREGIONS.  But 
wait — didn’t  we  see  this  problem  before,  in  Chapter  2?  Indeed,  Corollary  2.7 
states  that  we  can  compute  the  intersection  of  two  polygons  in  0(n  log  n  + 
klogn)  time,  where  11  is  the  total  number  of  vertices  in  the  two  polygons.  We 
must  be  a  bit  careful  in  applying  this  result  to  our  problem,  because  the  regions 
we  have  can  be  unbounded,  or  degenerate  to  a  segment  or  a  point.  Hence, 
the  regions  are  not  necessarily  polygons.  But  it  is  not  difficult  to  modify  the 
algorithm  from  Chapter  2  so  that  it  still  works. 

Let’s  analyze  this  approach.  Assume  we  have  already  computed  the  two  regions 
Ci  and  C2  by  recursion.  Since  they  are  both  defined  by  at  most  n/ 2+1  half¬ 
planes,  they  both  have  at  most  n/ 2+1  edges.  The  algorithm  from  Chapter  2 
computes  their  overlay  in  time  0((n  +  k)  logn),  where  k  is  the  number  of 
intersection  points  between  edges  of  Ci  and  edges  of  C2.  What  is  kl  Look 
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at  an  intersection  point  v  between  an  edge  e\  of  Ci  and  an  edge  eo  of  C2.  No 
matter  how  e\  and  ej  intersect,  v  must  be  a  vertex  of  Ci  D  C2.  But  Ci  (~l  C2  is  the 
intersection  of  n  half-planes,  and  therefore  has  at  most  n  edges  and  vertices.  It 
follows  that  k  ^  n,  so  the  computation  of  the  intersection  of  C\  and  Cb  takes 
0(n  logn)  time. 

This  gives  the  following  recurrence  for  the  total  running  time: 

7»={°(1)’ 

yO(n\ogn) +2T(n/2),  if«>l. 

This  recurrence  solves  to  T(n)  =  0(n\og2 n). 

To  obtain  this  result  we  used  a  subroutine  for  computing  the  intersection  of 
two  arbitrary  polygons.  The  polygonal  regions  we  deal  with  in  INTERSECT- 
HALFPLANES  are  always  convex.  Can  we  use  this  to  develop  a  more  efficient 
algorithm?  The  answer  is  yes,  as  we  show  next.  We  will  assume  that  the  regions 
we  want  to  intersect  are  2-dimensional;  the  case  where  one  or  both  of  them  is  a 
segment  or  a  point  is  easier  and  left  as  an  exercise. 


&ieft(C)  =  hi,li4,h5 
right  (C)  ^2  ■  h  | 


First,  let’s  specify  more  precisely  how  we  represent  a  convex  polygonal  region 
C.  We  will  store  the  left  and  the  right  boundary  of  C  separately,  as  sorted  lists 
of  half-planes.  The  lists  are  sorted  in  the  order  in  which  the  bounding  lines  of 
the  half-planes  occur  when  the  (left  or  right)  boundary  is  traversed  from  top  to 
bottom.  We  denote  the  left  boundary  list  by  T|eft(C),  and  the  right  boundary 
list  by  Cright(C).  Vertices  are  not  stored  explicitly;  they  can  be  computed  by 
intersecting  consecutive  bounding  lines. 

To  simplify  the  description  of  the  algorithm,  we  shall  assume  that  there  are 
no  horizontal  edges.  (To  adapt  the  algorithm  to  deal  with  horizontal  edges,  one 
can  define  such  edges  to  belong  to  the  left  boundary  if  they  bound  C  from  above, 
and  to  the  right  boundary  if  they  bound  C  from  below.  With  this  convention 
only  a  few  adaptations  are  needed  to  the  algorithm  stated  below.) 


The  new  algorithm  is  a  plane  sweep  algorithm,  like  the  one  in  Chapter  2:  we 
move  a  sweep  line  downward  over  the  plane,  and  we  maintain  the  edges  of  C\ 
and  C2  intersecting  the  sweep  line.  Since  C 1  and  C2  are  convex,  there  are  at  most 
four  such  edges.  Hence,  there  is  no  need  to  store  these  edges  in  a  complicated 
data  structure;  instead  we  simply  have  pointers  left.edge_Cl ,  right _edge_Cl, 
left_edge_C2,  and  right_edge_C2  to  them.  If  the  sweep  line  does  not  intersect 
the  right  or  left  boundary  of  a  region,  then  the  corresponding  pointer  is  nil. 
Figure  4.3  illustrates  the  definitions. 

How  are  these  pointers  initialized?  Let  y  1  be  the  y-coordinate  of  the  topmost 
vertex  of  Ci;  if  C\  has  an  unbounded  edge  extending  upward  to  infinity  then 
we  define  y \  =  °°.  Define  V2  similarly  for  C2,  and  let  ystart  =  rninfy  1  .V2).  To 
compute  the  intersection  of  C 1  and  C2  we  can  restrict  our  attention  to  the  part 
of  the  plane  with  y-coordinate  less  than  or  equal  to  ystart.  Hence,  we  let  the 
sweep  line  start  at  ystart,  and  we  initialize  the  edges  left  .edge  .CL  right  jedgeJCl, 
left_edge_C2,  and  right _edge  J22  as  the  ones  intersecting  the  line  y  =  ystart. 
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Figure  4.3 

The  edges  maintained  by  the  sweep  line 
algorithm 


In  a  plane  sweep  algorithm  one  normally  also  needs  a  queue  to  store  the  events. 
In  our  case  the  events  are  the  points  where  edges  of  C\  or  of  C2  start  or  stop  to 
intersect  the  sweep  line.  This  implies  that  the  next  event  point,  which  determines 
the  next  edge  to  be  handled,  is  the  highest  of  the  lower  endpoints  of  the  edges 
intersecting  the  sweep  line.  (Endpoints  with  the  same  y-coordinate  are  handled 
from  left  to  right.  If  two  endpoints  coincide  then  the  leftmost  edge  is  treated 
first.)  Hence,  we  don’t  need  an  event  queue;  the  next  event  can  be  found  in 
constant  time  using  the  pointers  left-edge-Cl.  right-edge  .Cl ,  left  .edge  JC2,  and 
right  _edgeJC2. 

At  each  event  point  some  new  edge  e  appears  on  the  boundary.  To  handle 
the  edge  e  we  first  check  whether  e  belongs  to  C\  or  to  C2,  and  whether  it  is  on 
the  left  or  the  right  boundary,  and  then  call  the  appropriate  procedure.  We  shall 
only  describe  the  procedure  that  is  called  when  e  is  on  the  left  boundary  of  Cy. 
The  other  procedures  are  similar. 

Let  p  be  the  upper  endpoint  of  e.  The  procedure  that  handles  e  will  discover  three 
possible  edges  that  C  might  have:  the  edge  with  p  as  upper  endpoint,  the  edge 
with  e  n  left -edge -C2  as  upper  endpoint,  and  the  edge  with  e  D  right -edge  -C2 
as  upper  endpoint.  It  performs  the  following  actions. 

■  First  we  test  whether  p  lies  in  between  left_edge_C2  and  right _edge_C2.  If 
this  is  the  case,  then  e  contributes  an  edge  to  C  starting  at  p.  We  then  add 
the  half-plane  whose  bounding  line  contains  e  to  the  list  <C|eft(C). 

■  Next  we  test  whether  e  intersects  right -edge-C2.  If  this  is  the  case,  then  the 
intersection  point  is  a  vertex  of  C.  Either  both  edges  contribute  an  edge  to 
C  starting  at  the  intersection  point — this  happens  when  p  lies  to  the  right 
of  right _edge -C2,  as  in  Figure  4.4(i) — or  both  edges  contribute  an  edge 
ending  there — this  happens  when  p  lies  to  the  left  of  right  edge  C2,  as  in 
Figure  4.4(ii).  If  both  edges  contribute  an  edge  starting  at  the  intersection 
point,  then  we  have  to  add  the  half-plane  defining  e  to  £ieft(C)  and  the 
half-plane  defining  right _edge_C2  to  <Cnght(C).  If  they  contribute  an  edge 
ending  at  the  intersection  point  we  do  nothing;  these  edges  have  already 
been  discovered  in  some  other  way. 


■  Finally  we  test  whether  e  intersects  left  -edge  -C2.  If  this  is  the  case,  then  the 
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Figure  4.4 

The  two  possibilities  when  e  intersects 
right_edge_C2 


intersection  point  is  a  vertex  of  C.  The  edge  of  C  starting  at  that  vertex  is 
either  a  part  of  e  or  it  is  a  part  of  left-edge-C2.  We  can  decide  between  these 
possibilities  in  constant  time:  if  p  lies  to  the  left  of  left-edge  C2  then  it  is  a 
part  of  e,  otherwise  it  is  a  part  of  left_edgeJ22.  After  we  decided  whether  e 
or  left_edge_C2  contributes  the  edge  to  C,  we  add  the  appropriate  half-plane 
to  £<ieft(C). 

Notice  that  we  may  add  two  half-planes  to  T|eft(C):  the  half-plane  bounding  e 
and  the  half-plane  bounding  left-edge-C2.  In  which  order  should  we  add  them? 
We  add  left-edge -C2  only  if  it  defines  an  edge  of  C  starting  at  the  intersection 
point  of  left-edge-C2  and  e.  If  we  also  decide  to  add  the  half-plane  of  e,  it 
must  be  because  e  defines  an  edge  of  C  starting  at  its  upper  endpoint  or  at 
its  intersection  point  with  right _edge_C2.  In  both  cases  we  should  add  the 
half-plane  bounding  e  first,  which  is  guaranteed  by  the  order  of  the  tests  given 
above. 
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We  conclude  that  it  takes  constant  time  to  handle  an  edge,  so  the  intersection  of 
two  convex  polygons  can  be  computed  in  time  0(n).  To  show  that  the  algorithm 
is  correct,  we  have  to  prove  that  it  adds  the  half-planes  defining  the  edges  of  C 
in  the  right  order.  Consider  an  edge  of  C,  and  let  p  be  its  upper  endpoint.  Then 
p  is  either  an  upper  endpoint  of  an  edge  in  C\  or  C2,  or  it  is  the  intersection  of 
two  edges  e  and  e'  of  C\  and  C2,  respectively.  In  the  former  case  we  discover  the 
edge  of  C  when  p  is  reached,  and  in  the  latter  case  when  the  lower  of  the  upper 
endpoints  of  e  and  e'  is  reached.  Hence,  all  half-planes  defining  the  edges  of  C 
are  added.  It  is  not  difficult  to  prove  that  they  are  added  in  the  correct  order. 

We  get  the  following  result: 

Theorem  4.3  The  intersection  of  two  convex  polygonal  regions  in  the  plane  can 
be  computed  in  0(n)  time. 

This  theorem  shows  that  we  can  do  the  merge  step  in  IntersectHalf- 
PLANES  in  linear  time.  Hence,  the  recurrence  for  the  running  time  of  the 
algorithm  becomes 


Tin) 


0(1),  if  «  =  1, 

0{n)  +2T{n/2),  if  77  >  1 , 


leading  to  the  following  result: 


Corollary  4.4  The  common  intersection  of  a  set  of  n  half-planes  in  the  plane 
can  be  computed  in  0(u  log/;)  time  and  linear  storage. 

The  problem  of  computing  the  intersection  of  half-planes  is  intimately 
related  to  the  computation  of  convex  hulls,  and  an  alternative  algorithm  can  be 
given  that  is  almost  identical  to  algorithm  ConvexHull  from  Chapter  1.  The 
relationship  between  convex  hulls  and  intersections  of  half-planes  is  discussed 
in  detail  in  Sections  8.2  and  1 1.4.  Those  sections  are  independent  of  the  rest  of 
their  chapters,  so  if  you  are  curious  you  can  already  have  a  look. 


4.3  Incremental  Linear  Programming 

In  the  previous  section  we  showed  how  to  compute  the  intersection  of  a  set  of 
n  half-planes.  In  other  words,  we  computed  all  solutions  to  a  set  of  n  linear 
constraints.  The  running  time  of  our  algorithm  was  0(n  log/;).  One  can  prove 
that  this  is  optimal:  as  for  the  sorting  problem,  any  algorithm  that  solves  the 
half-plane  intersection  problem  must  take  fl(n\ogn)  time  in  the  worst  case.  In 
our  application  to  the  casting  problem,  however,  we  don’t  need  to  know  all 
solutions  to  the  set  of  linear  constraints;  just  one  solution  will  do  fine.  It  turns 
out  that  this  allows  for  a  faster  algorithm. 

Finding  a  solution  to  a  set  of  linear  constraints  is  closely  related  to  a  well- 
known  problem  in  operations  research,  called  linear  optimization  or  linear 
programming.  (This  term  was  coined  before  “programming”  came  to  mean 
“giving  instructions  to  a  computer”.)  The  only  difference  is  that  linear  program¬ 
ming  involves  finding  one  specific  solution  to  the  set  of  constraints,  namely  the 
one  that  maximizes  a  given  linear  function  of  the  variables.  More  precisely,  a 
linear  optimization  problem  is  described  as  follows: 

Maximize  c  i  x\  +  C2X2  H - 1-  cdxd 

Subject  to  apixi-t - I- a \  dxci  ^  b\ 

02,1*1  H - \-ai,dXd  <  b2 

0?^  1*1  T  ■  ■  ■  T  an  dxd  f  bn 

where  the  a,  and  a,j ,  and  b;  are  real  numbers,  which  form  the  input  to  the 
problem.  The  function  to  be  maximized  is  called  the  objective  function,  and 
the  set  of  constraints  together  with  the  objective  function  is  a  linear  program. 
The  number  of  variables,  d,  is  the  dimension  of  the  linear  program.  We  already 
saw  that  linear  constraints  can  be  viewed  as  half-spaces  in  The  intersection 
of  these  half-spaces,  which  is  the  set  of  points  satisfying  all  constraints,  is 
called  the  feasible  region  of  the  linear  program.  Points  (solutions)  in  this  region 
are  called  feasible,  points  outside  are  infeasible.  Recall  from  Figure  4.2  that 
the  feasible  region  can  be  unbounded,  and  that  it  can  be  empty.  In  the  latter 
case,  the  linear  program  is  called  infeasible.  The  objective  function  can  be 
viewed  as  a  direction  in  M.d:  maximizing  cixj  +  C2X2  4 - \-cdxd  means  finding 
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feasible  region 


a  point  (x] , . . .  ,Xd)  that  is  extreme  in  the  direction  c  =  (ci , . . . ,  q).  Hence,  the 
solution  to  the  linear  program  is  a  point  in  the  feasible  region  that  is  extreme 
in  direction  c.  We  let  fp  denote  the  objective  function  defined  by  a  direction 
vector  c. 

Many  problems  in  operations  research  can  be  described  by  linear  programs, 
and  a  lot  of  work  has  been  dedicated  to  linear  optimization.  This  has  resulted  in 
many  different  linear  programming  algorithms,  several  of  which — the  famous 
simplex  algorithm  for  instance — perform  well  in  practice. 

Let’s  go  back  to  our  problem.  We  have  n  linear  constraints  in  two  variables 
and  we  want  to  find  one  solution  to  the  set  of  constraints.  We  can  do  this 
by  taking  an  arbitrary  objective  function,  and  then  solving  the  linear  program 
defined  by  the  objective  function  and  the  linear  constraints.  For  the  latter  step 
we  can  use  the  simplex  algorithm,  or  any  other  linear  programming  algorithm 
developed  in  operations  research.  However,  this  particular  linear  program  is 
quite  different  from  the  ones  usually  studied:  in  operations  research  both  the 
number  of  constraints  and  the  number  of  variables  are  large,  but  in  our  case  the 
number  of  variables  is  only  two.  The  traditional  linear  programming  methods 
are  not  very  efficient  in  such  low-dimensional  linear  programming  problems', 
methods  developed  in  computational  geometry,  like  the  one  described  below, 
do  better. 


We  denote  the  set  of  n  linear  constraints  in  our  2-dimensional  linear  program¬ 
ming  problem  by  H.  The  vector  defining  the  objective  function  is  c  =  ( cx,cy ); 
thus  the  objective  function  is  fp(p)  =  cxpx  +  cypy.  Our  goal  is  to  find  a  point 
p  £  1R2  such  that  p  £  f]H  and  fp{p)  is  maximized.  We  denote  the  linear  program 
by  ( H,c ),  and  we  use  C  to  denote  its  feasible  region.  We  can  distinguish  four 
cases  for  the  solution  of  a  linear  program  ( H,c ).  The  four  cases  are  illustrated 
in  Figure  4.5;  the  vector  defining  the  objective  function  is  vertically  downward 
in  the  examples. 


Figure  4.5 

Different  types  of  solutions  to  a  linear 
program. 
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(i)  The  linear  program  is  infeasible,  that  is,  there  is  no  solution  to  the  set  of 
constraints. 

(ii)  The  feasible  region  is  unbounded  in  direction  c.  In  this  case  there  is  a  ray 
p  completely  contained  in  the  feasible  region  C,  such  that  the  function  /'- 
takes  arbitrarily  large  values  along  p.  The  solution  we  require  in  this  case 
is  the  description  of  such  a  ray. 

(iii)  The  feasible  region  has  an  edge  e  whose  outward  normal  points  in  the 
direction  c.  In  this  case,  there  is  a  solution  to  the  linear  program,  but  it  is 
not  unique:  any  point  on  e  is  a  feasible  point  that  maximizes  fp(p). 

(iv)  If  none  of  the  preceding  three  cases  applies,  then  there  is  a  unique  solution, 
which  is  the  vertex  v  of  C  that  is  extreme  in  the  direction  c. 


Our  algorithm  for  2-dimensional  linear  programming  is  incremental.  It  adds  the  Section  4.3 
constraints  one  by  one,  and  maintains  the  optimal  solution  to  the  intermediate  incremental  linear 
linear  programs.  It  requires,  however,  that  the  solution  to  each  intermediate  programming 
problem  is  well-defined  and  unique.  In  other  words,  it  assumes  that  each 
intermediate  feasible  region  has  a  unique  optimal  vertex  as  in  case  (iv)  above. 

To  fulfill  this  requirement,  we  add  to  our  linear  program  two  additional 
constraints  that  will  guarantee  that  the  linear  program  is  bounded.  For  example, 
if  cx  >  0  and  cy  >  0  we  add  the  contraints  px  sC  M  and  py  sC  M,  for  some 
large  MgR.  The  idea  is  that  M  should  be  chosen  so  large  that  the  additional 
constraints  do  not  influence  the  optimal  solution,  if  the  original  linear  program 
was  bounded. 

In  many  practical  applications  of  linear  programming,  a  bound  of  this  form 
is  actually  a  natural  restriction.  In  our  application  to  the  casting  problem,  for 
instance,  mechanical  limitations  will  not  allow  us  to  remove  the  polyhedron  in 
a  direction  that  is  nearly  horizontal.  For  instance,  we  may  not  be  able  to  remove 
the  polyhedron  in  a  direction  whose  angle  with  the  xy-plane  is  less  than  1  degree. 

This  constraint  immediately  gives  a  bound  on  the  absolute  value  of  px,  py. 

We  will  discuss  in  Section  4.5  how  we  can  correctly  recognize  unbounded 
linear  programs,  and  how  we  can  solve  bounded  ones  without  enforcing  artificial 
constraints  on  the  solution. 

For  preciseness,  let’s  give  a  name  to  the  two  new  constraints: 


and 


f  px  ^  M  if  cx  >  0 
\  ~Px  ^  M  otherwise 

f  py  <  M  if  cy  >  0 
\  ~Py  ^  M  otherwise 


Note  that  mi ,  mi  are  chosen  as  a  function  of  c  only,  they  do  not  depend  on  the 
half-planes  H.  The  feasible  region  Co  =  m\  P  mi  is  an  orthogonal  wedge. 

Another  simple  convention  now  allows  us  to  say  that  case  (iii)  also  has  a 
unique  solution:  if  there  are  several  optimal  points,  then  we  want  the  lexico¬ 
graphically  smallest  one.  Conceptually,  this  convention  is  equivalent  to  rotating 
c  a  little,  such  that  it  is  no  longer  normal  to  any  half-plane. 

We  have  to  be  careful  when  doing  this,  as  even  a  bounded  linear  program 
may  not  have  a  lexicographically  smallest  solution  (see  Exercise  4.11).  Our 
choice  of  the  two  constraints  m\  and  m2  is  such  that  this  cannot  happen. 

With  these  two  conventions,  any  linear  program  that  is  feasible  has  a  unique 
solution,  which  is  a  vertex  of  the  feasible  region.  We  call  this  vertex  the  optimal 
vertex. 


Let  (//, c)  be  a  linear  program.  We  number  the  half-planes  h\,  /12, . . . ,  hn ■  Let 
Hi  be  the  set  of  the  first  i  constraints,  together  with  the  special  constraints  m\ 
and  m2,  and  let  C,  be  the  feasible  region  defined  by  these  constraints: 


Hi  ■=  {mi,m2,hi,h2 ■  ■  ■  ,h{}, 
Cj  :=  m\  fl m2  fl hi  (1  h2  0  •  •  •  fl 
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Chapter  4  By  our  choice  of  Co,  each  feasible  region  C,  has  a  unique  optimal  vertex,  denoted 
linear  programming  V;.  Clearly,  we  have 

Co  2  Ci  D  C2  ■  ■  -  2  Cn  =  C. 

This  implies  that  if  C,  =  0  for  some  ;,  then  Cj  =  0  for  all  j  f  i,  and  the  linear 
program  is  infeasible.  So  our  algorithm  can  stop  once  the  linear  program 
becomes  infeasible. 


The  next  lemma  investigates  how  the  optimal  vertex  changes  when  we  add  a 
half-plane  It  is  the  basis  of  our  algorithm. 

Lemma  4.5  Let  1  ^  i  ^  n,  and  let  Cj  and  v,  be  defined  as  above.  Then  we  have 

( i)  If  Vi- 1  G  hi ,  then  v,  =  v,-_  i . 

(ii)  If  Vi- 1  ^  hj,  then  eitherCi  =  0  orv,-  G  £j,  where  £,■  is  the  line  bounding  hi. 

Proof,  (i)  Let  v,_i  G  b;.  Because  Ci  =  Q-\  C\hi  and  v,-_i  €  C,_ i  this  means  that 
Vi- 1  G  C,.  Furthermore,  the  optimal  point  in  C,  cannot  be  better  than  the 
optimal  point  in  C,_  i,  since  C,  C  C,_  i .  Hence,  v,_i  is  the  optimal  vertex  in 
Ci  as  well. 

(ii)  Let  Vi- 1  f  hi.  Suppose  for  a  contradiction  that  C,  is  not  empty  and  that  v, 
does  not  lie  on  £,.  Consider  the  line  segment  v,-_i  v,-.  We  have  v,_i  G  C,_i 
and,  since  Cj  C  C,_  i,  also  v,  G  C, .  i .  Together  with  the  convexity  of  Ci- 1, 
this  implies  that  the  segment  v,-_  i  v\  is  contained  in  C,_i.  Since  v,_i  is  the 
optimal  point  in  C,-_  i  and  the  objective  function  f-g  is  linear,  it  follows  that 
fc(p)  increases  monotonically  along  v,-_  j  v,-  as  p  moves  from  v(-  to  Now 
consider  the  intersection  point  q  of  v,-_  i  vj  and  £,.  This  intersection  point 
exists,  because  v,-_i  f  hi  and  v,-  G  C,  .  Since  v,-_  i  vj  is  contained  in  C,-_  i,  the 
point  <7  must  be  in  C,-.  But  the  value  of  the  objective  function  increases  along 
v,-_ i  v';,  so  fc(q)  >  fciyi).  This  contradicts  the  definition  of  v,.  0 

Figure  4.6  ilhistrates  the  two  cases  that  arise  when  adding  a  half-plane. 
In  Figure  4.6(i),  the  optimal  vertex  14  that  we  have  after  adding  the  first  four 
half-planes  is  contained  in  h$,  the  next  half-plane  that  we  add.  Therefore  the 
optimal  vertex  remains  the  same.  The  optimal  vertex  is  not  contained  in  h^, 
however,  so  when  we  add  h(,  we  must  find  a  new  optimal  vertex.  According 


Figure  4.6 
Adding  a  half-plane 


to  Lemma  4.5,  this  vertex  ly,  is  contained  in  the  line  bounding  h(„  as  is  shown 
in  Figure  4.6(ii).  But  Lemma  4.5  does  not  tell  us  how  to  find  the  new  optimal 
vertex.  Fortunately,  this  is  not  so  difficult,  as  we  show  next. 


Assume  that  the  current  optimal  vertex  v;_  \  is  not  contained  in  the  next  half-  Section  4.3 

plane  The  problem  we  have  to  solve  can  be  stated  as  follows:  incremental  linear 

PROGRAMMING 

Find  the  point  p  on  £,  that  maximizes  fc(p),  subject  to  the  con¬ 
straints  p  £  h,  for  h  £  Hi-\. 

To  simplify  the  terminology,  we  assume  that  £,  is  not  vertical,  and  so  we  can 
parameterize  it  by  x-coordinate.  We  can  then  define  a  function  /j  :  t  ^  I 
such  that  fc(p)  =  fc(Px)  for  points  p  £  £j.  For  a  half-plane  h ,  let  o(h,£i)  be  the 
x-coordinate  of  the  intersection  point  of  /:,■  and  the  bounding  line  of  h.  (If  there 
is  no  intersection,  then  either  the  constraint  h  is  satisfied  by  any  point  on  £,.  or 
by  no  point  on  £,.  In  the  former  case  we  can  ignore  the  constraint,  in  the  latter 
case  we  can  report  the  linear  program  infeasible.)  Depending  on  whether  £,  n  h 
is  bounded  to  the  left  or  to  the  right,  we  get  a  constraint  on  the  x-coordinate  of 
the  solution  of  the  form  x  ^  <j(hJ:,)  or  of  the  form  x  U  We  can  thus 

restate  our  problem  as  follows: 

Maximize  fc(x) 


subject  to  x  ^  <j(h,  £j) ,  h  £  //,■_ i  and  £j  n  h  is  bounded  to  the  left 
x  ^  <j(h,£j),  h  £  Hi _  i  and  £,■  D  h  is  bounded  to  the  right 

This  is  a  1 -dimensional  linear  program.  Solving  it  is  very  easy.  Let 

xieft  =  max  { cr (h,£j)  :  £j  nil  is  bounded  to  the  left} 

h€Hi_i 


and 

Xrfght  =  min  {<7 (h,£i)  :  £j  D  /;  is  bounded  to  the  right} . 

h€Hj_i 

The  interval  [xieft  :  xnght]  is  the  feasible  region  of  the  1 -dimensional  linear 
program.  Flence,  the  linear  program  is  infeasible  if  xieft  >  xnghi,  and  otherwise 
the  optimal  point  is  the  point  on  ()  at  either  xieft  or  xnght,  depending  on  the 
objective  function. 

Note  that  the  1 -dimensional  linear  program  cannot  be  unbounded,  due  to 
the  constraints  mi  and  m2. 

We  get  the  following  lemma: 

Lemma  4.6  A  1  -dimensional  linear  program  can  be  solved  in  linear  time.  Hence, 
if  case  (ii)  of  Lemma  4.5  arises,  then  we  can  compute  the  new  optimal  vertex  v,-, 
or  decide  that  the  linear  program  is  infeasible,  in  0(i)  time. 


X  <  <7(11,1, 


We  can  now  describe  the  linear  programming  algorithm  in  more  detail.  As 
above,  we  use  £i  to  denote  the  line  that  bounds  the  half-plane  hj. 

Algorithm  2dBoundedLP(//,  c, mi , m2) 

Input.  A  linear  program  (HU  {mi, m2},?),  where  H  is  a  set  of  11  half-planes, 
? £  R2,  and  mi,  m 2  bound  the  solution. 

Output.  If  (HU  {mi, m2}, c)  is  infeasible,  then  this  fact  is  reported.  Otherwise, 
the  lexicographically  smallest  point  p  that  maximizes  f?(p)  is  reported. 
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1 .  Let  i’o  be  the  comer  of  Co- 

2.  Let  h\,. . .  ,hn  be  the  half-planes  of  H. 

3.  for  i  <—  1  to  n 

4.  do  if  V;_i  G  hj 

5.  thenvi<— V;_i 

6.  else  Vi  the  point  p  on  /,  that  maximizes  f?(p),  subject  to  the 

constraints  in  //,_  i. 

7.  if  p  does  not  exist 

8.  then  Report  that  the  linear  program  is  infeasible  and  quit. 

9.  return  v„ 


We  now  analyze  the  performance  of  our  algorithm. 

Lemma  4.7  Algorithm  2dBoundedLP  computes  the  solution  to  a  bounded 
linear  program  with  n  constraints  and  two  variables  in  0(n 2)  time  and  linear 
storage. 

Proof.  To  prove  that  the  algorithm  correctly  finds  the  solution,  we  have  to  show 
that  after  every  stage — whenever  we  have  added  a  new  half-plane  /z(- — the  point 
Vi  is  still  the  optimum  point  for  C,.  This  follows  immediately  from  Lemma  4.5. 
If  the  1 -dimensional  linear  program  on  £,  is  infeasible,  then  C,  is  empty,  and 
consequently  C  =  Cn  C  C,  is  empty,  which  means  that  the  linear  program  is 
infeasible. 

It  is  easy  to  see  that  the  algorithm  requires  only  linear  storage.  We  add  the 
half-planes  one  by  one  in  n  stages.  The  time  spent  in  stage  i  is  dominated  by  the 
time  to  solve  a  1-dimensional  linear  program  in  line  6,  which  is  O(i).  Hence, 
the  total  time  needed  is  bounded  by 


io(i)=0(n*).  □ 

i=l 

Although  our  linear  programming  algorithm  is  nice  and  simple,  its  running 
time  is  disappointing — the  algorithm  is  much  slower  than  the  previous  algorithm, 
which  computed  the  whole  feasible  region.  Is  our  analysis  too  crude?  We 
bounded  the  cost  of  every  stage  i  by  0(i) .  This  is  not  always  a  tight  bound: 
Stage  i  takes  ©(/)  time  only  when  v,_i  f  hg  when  v,_i  G  hi  then  stage  i  takes 
constant  time.  So  if  we  could  bound  the  number  of  times  the  optimal  vertex 
changes,  we  might  be  able  to  prove  a  better  running  time.  Unfortunately  the 
optimum  vertex  can  change  n  times:  there  are  orders  for  some  configurations 
where  every  new  half-plane  makes  the  previous  optimum  illegal.  The  figure  in 
the  margin  shows  such  an  example.  This  means  that  the  algorithm  will  really 
spend  &(n2)  time.  How  can  we  avoid  this  nasty  situation? 


76 


4.4  Randomized  Linear  Programming 

If  we  have  a  second  look  at  the  example  where  the  optimum  changes  n  times, 
we  see  that  the  problem  is  not  so  much  that  the  set  of  half-planes  is  bad.  If  we 


had  added  them  in  the  order  hn-i, . . . ,  h2,  then  the  optimal  vertex  would  not  Section  4.4 

change  anymore  after  the  addition  of  hn.  In  this  case  the  running  time  would  be  randomized  linear 

0(n).  Is  this  a  general  phenomenon?  Is  it  true  that,  for  any  set  H  of  half-planes,  programming 

there  is  a  good  order  to  treat  them?  The  answer  to  this  question  is  “yes,”  but 

that  doesn’t  seem  to  help  us  much.  Even  if  such  a  good  order  exists,  there 

seems  to  be  no  easy  way  to  actually  find  it.  Remember  that  we  have  to  find  the 

order  at  the  beginning  of  the  algorithm,  when  we  don’t  know  anything  about 

the  intersection  of  the  half-planes  yet. 

We  now  meet  a  quite  intriguing  phenomenon.  Although  we  have  no  way  to 
determine  an  ordering  of  H  that  is  guaranteed  to  lead  to  a  good  running  time, 
we  have  a  very  simple  way  out  of  our  problem.  We  simply  pick  a  random 
ordering  of  H.  Of  course,  we  could  have  bad  luck  and  pick  an  order  that  leads 
to  a  quadratic  running  time.  But  with  some  luck,  we  pick  an  order  that  makes  it 
run  much  faster.  Indeed,  we  shall  prove  below  that  most  orders  lead  to  a  fast 
algorithm.  For  completeness,  we  first  repeat  the  algorithm. 

Algorithm  2dR andomizedB  oundedLP(//,  c,  m i ,  m2) 

Input.  A  linear  program  (II LJ  {ni\  ,m2},c),  where  H  is  a  set  of  n  half-planes, 
c  €  R2,  and  m\,  m2  bound  the  solution. 

Output.  If  (HU  {mi,m2},c)  is  infeasible,  then  this  fact  is  reported.  Otherwise, 
the  lexicographically  smallest  point  p  that  maximizes  /'-( p)  is  reported. 

1 .  Let  v'o  be  the  corner  of  Co- 

2.  Compute  a  random  permutation  hi,.. . , hn  of  the  half-planes  by  calling 
RandomPermutation(7/[1  •  ••«]). 

3.  for  i  <—  1  to  n 

4.  do  if  i  e  hj 

5.  then  v,-  <—  v,-_  j 

6.  else  Vi  the  point  p  on  /,  that  maximizes  /'-(/?),  subject  to  the 

constraints  in  Ht-  \ . 

7.  if  p  does  not  exist 

8.  then  Report  that  the  linear  program  is  infeasible  and  quit. 

9.  return  v„ 

The  only  difference  from  the  previous  algorithm  is  in  line  2,  where  we  put  the 
half-planes  in  random  order  before  we  start  adding  them  one  by  one.  To  be  able 
to  do  this,  we  assume  that  we  have  a  random  number  generator.  Random(L), 
which  has  an  integer  k  as  input  and  generates  a  random  integer  between  1  and  k 
in  constant  time.  Computing  a  random  permutation  can  then  be  done  with  the 
following  linear  time  algorithm. 

Algorithm  RandomPermutation(A) 

Input.  An  array  A  [1  •  •  •  n] . 

Output.  The  array  A[l---«]  with  the  same  elements,  but  rearranged  into  a 
random  permutation. 

1 .  for  k  <—  n  downto  2 

2.  do  rndindex  <—  Random(C) 

3.  Exchange  A [k]  and  A[rndindex}. 
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Chapter  4  The  new  linear  programming  algorithm  is  called  a  randomized  algorithm ;  its 
linear  programming  running  time  depends  on  certain  random  choices  made  by  the  algorithm.  (In 
the  linear  programming  algorithm,  these  random  choices  were  made  in  the 
subroutine  RandomPermutation.) 

What  is  the  running  time  of  this  randomized  version  of  our  incremental  linear 
programming  algorithm?  There  is  no  easy  answer  to  that.  It  all  depends  on 
the  order  that  is  computed  in  line  2.  Consider  a  fixed  set  H  of  n  half-planes. 
2DRANDOMIZEDBOUNDEDLP  treats  them  depending  on  the  permutation  cho¬ 
sen  in  line  2.  Since  there  are  n !  possible  permutations  of  n  objects,  there  are 
n !  possible  ways  in  which  the  algorithm  can  proceed,  each  with  its  own  run¬ 
ning  time.  Because  the  permutation  is  random,  each  of  these  running  times 
is  equally  likely.  So  what  we  do  is  analyze  the  expected  running  time  of  the 
algorithm,  which  is  the  average  running  time  over  all  n  \  possible  permutations. 
The  lemma  below  states  that  the  expected  running  time  of  our  randomized  linear 
programming  algorithm  is  0{n).  It  is  important  to  realize  that  we  do  not  make 
any  assumptions  about  the  input:  the  expectancy  is  with  respect  to  the  random 
order  in  which  the  half-planes  are  treated  and  holds  for  any  set  of  half-planes. 

Lemma  4.8  The  2-dimensional  linear  programming  problem  with  n  constraints 
can  be  solved  in  0(n)  randomized  expected  time  using  worst-case  linear  storage. 

Proof.  As  we  observed  before,  the  storage  needed  by  the  algorithm  is  linear. 

The  running  time  RandomPermutation  is  0(n),  so  what  remains  is  to 
analyze  the  time  needed  to  add  the  half-planes  Adding  a  half-plane 

takes  constant  time  when  the  optimal  vertex  does  not  change.  When  the  optimal 
vertex  does  change  we  need  to  solve  a  1 -dimensional  linear  program.  We  now 
bound  the  time  needed  for  all  these  1 -dimensional  linear  programs. 

Let  Xj  be  a  random  variable,  which  is  1  if  v,_i  ^  /z,-,  and  0  otherwise.  Recall 
that  a  1 -dimensional  linear  program  on  i  constraints  can  be  solved  in  O(i)  time. 
The  total  time  spent  in  line  6  over  all  half-planes  h\,...,hn  is  therefore 

E°(  i)‘Xi- 

i—  I 

To  bound  the  expected  value  of  this  sum  we  will  use  linearity  of  expectation',  the 
expected  value  of  a  sum  of  random  variables  is  the  sum  of  the  expected  values 
of  the  random  variables.  This  holds  even  if  the  random  variables  are  dependent. 
Hence,  the  expected  time  for  solving  all  1 -dimensional  linear  programs  is 

e[£  o(o  •*<]=£  o(o -EM. 

;=i  i=i 

But  what  is  E[X,-]?  It  is  exactly  the  probability  that  v;_i  ^  hi.  Let’s  analyze  this 
probability. 

We  will  do  this  with  a  technique  called  backwards  analysis:  we  look  at  the 
algorithm  “backwards.”  Assume  that  it  has  already  finished,  and  that  it  has 
computed  the  optimum  vertex  vn.  Since  v„  is  a  vertex  of  C„,  it  is  defined  by  at 


least  two  of  the  half-planes.  Now  we  make  one  step  backwards  in  time,  and 
look  at  C„_ i-  Note  that  C„-i  is  obtained  from  Cn  by  removing  the  half-plane  hn. 
When  does  the  optimum  point  change?  This  happens  exactly  if  v„  is  not  a  vertex 
of  C„_  i  that  is  extreme  in  the  direction  c,  which  is  only  possible  if  hn  is  one  of 
the  half-planes  that  define  v„.  But  the  half-planes  are  added  in  random  order,  so 
hn  is  a  random  element  of  {hi,h-2,  . . .  ,h„}.  Hence,  the  probability  that  hn  is  one 
of  the  half-planes  defining  v„  is  at  most  2/n.  Why  do  we  say  “at  most”?  First,  it 
is  possible  that  the  boundaries  of  more  than  two  half-planes  pass  through  v„.  In 
that  case,  removing  one  of  the  two  half-planes  containing  the  edges  incident  to 
v„  may  fail  to  change  v„.  Furthermore,  v„  may  be  defined  by  ni\  or  mi,  which 
are  not  among  the  n  candidates  for  the  random  choice  of  h„.  In  both  cases  the 
probability  is  less  than  2/n. 

The  same  argument  works  in  general:  to  bound  E[2Q],  we  fix  the  subset 
of  the  first  i  half-planes.  This  determines  C,-.  To  analyze  what  happened  in 
the  last  step,  when  we  added  hi,  we  think  backwards.  The  probability  that 
we  had  to  compute  a  new  optimal  vertex  when  adding  /i,  is  the  same  as  the 
probability  that  the  optimal  vertex  changes  when  we  remove  a  half-plane  from 
Cj.  The  latter  event  only  takes  place  for  at  most  two  half-planes  of  our  fixed  set 
{h\, . . Since  the  half-planes  are  added  in  random  order,  the  probability 
that  hj  is  one  of  the  special  half-planes  is  at  most  2 / i.  We  derived  this  probability 
under  the  condition  that  the  first ;  half-planes  are  some  fixed  subset  of  H.  But 
since  the  derived  bound  holds  for  any  fixed  subset,  it  holds  unconditionally. 
Hence,  E[2Q]  ^  2 /i.  We  can  now  bound  the  expected  total  time  for  solving  all 
1 -dimensional  linear  programs  by 

£o(«)-y  =  0(n). 

i=i  ' 

We  already  noted  that  the  time  spent  in  the  rest  of  the  algorithm  is  O(n)  as 
well.  0 

Note  again  that  the  expectancy  here  is  solely  with  respect  to  the  random 
choices  made  by  the  algorithm.  We  do  not  average  over  possible  choices  for 
the  input.  For  any  input  set  of  n  half-planes,  the  expected  running  time  of  the 
algorithm  is  0(ii)\  there  are  no  bad  inputs. 


4.5  Unbounded  Linear  Programs 

In  the  preceding  sections  we  avoided  handling  the  case  of  an  unbounded  linear 
program  by  adding  two  additional,  artificial  constraints.  This  is  not  always  a 
suitable  solution.  Even  if  the  linear  program  is  bounded,  we  may  not  know  a 
large  enough  bound  M.  Furthermore,  unbounded  linear  programs  do  occur  in 
practice,  and  we  have  to  solve  them  correctly. 

Let’s  first  see  how  we  can  recognize  whether  a  given  linear  program  ( H,c ) 
is  unbounded.  As  we  saw  before,  that  means  that  there  is  a  ray  p  completely 
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Chapter  4  contained  in  the  feasible  region  C,  such  that  the  function  fy  takes  arbitrarily 
linear  programming  large  values  along  p. 

If  we  denote  the  ray’s  starting  point  as  p,  and  its  direction  vector  as  d,  we 
can  parameterize  p  as  follows: 

p  =  {p  +  Xd  :  A  >0}. 

The  function  f?:  takes  arbitrarily  large  values  if  and  only  if  d  ■  c  >  0.  On  the 
other  hand,  if  rj  (h)  is  the  normal  vector  of  a  half-plane  h  £  II  oriented  towards 
the  feasible  side  of  /z’s  bounding  line,  we  have  d  ■  rj(h)  f  0.  The  next  lemma 
shows  that  these  two  necessary  conditions  on  d  are  sufficient  to  test  whether  a 
linear  program  is  unbounded. 

Lemma  4.9  A  linear  program  (II .  c)  is  unbounded  if  and  only  if  there  is  a  vector 
d  withd-c>0  such  that  d  ■  rj  (h)  f  0  for  all  h  £  II  and  the  linear  program  ( H ' ,  c) 
is  feasible,  where  H'  =  {h  £  H  :  fj  (h)  ■  d  =  0}  . 

Proof.  The  “only  if'  direction  follows  from  the  argument  above,  so  it  remains 
to  show  the  “if"  direction. 

We  consider  a  linear  program  (II .  c)  and  a  vector  d  with  the  conditions  of 
the  lemma.  Since  (H1  ,c)  is  feasible,  there  is  a  point  po  £  ClheH1  h-  Consider  now 
the  ray  po  :=  {po  +  A d  :  A  >  0}.  Since  d-ij(h)  =  0  for  h  £  H' ,  the  ray  po  is 
completely  contained  in  each  h  £  H' .  Furthermore,  since  d  ■  c  >  0  the  objective 
function  takes  arbitrarily  large  values  along  po. 

For  a  half-plane  h  £  H\H',  we  have  d  ■  rj  (h)  >  0.  This  implies  that  there  is 
a  parameter  A/;  such  that  po  +  A d  £  h  for  all  A  ^  A/,.  Let  A'  :=  max(t<:H  //!  A/,, 
and  p  :=  po  +  A ' d.  It  follows  that  the  ray 

p  =  {p  +  Xd  :  A  >  0} 

is  completely  contained  in  each  half-plane  h  £  H,  and  so  (II  .  c)  is  unbounded. 

□ 

We  can  now  test  whether  a  given  2-dimensional  linear  program  ( H,c )  is 
unbounded  by  proceeding  similarly  to  Section  4.1,  and  solving  a  1-dimensional 
linear  program. 

Let’s  first  rotate  the  coordinate  system  so  that  c  is  the  upward  vertical 
direction,  c  =  (0,1).  Any  direction  vector  d  =  (dx,dy)  with  d  -c  >  0  can  be 
normalized  to  the  form  d  =  (dx.  1),  and  be  represented  by  the  point  dx  on  the 
line  y  =  1.  Given  a  normal  vector  rj(Ii)  =  (rjx,  rjy),  the  inequality 

d-  rj  (h)  =  dxTjx  +  rjv  ^  0 

translates  to  the  inequality  dxt)x  f  —7 y.  We  thus  obtain  a  system  of  n  linear 
inequalities,  or,  in  other  words,  a  1 -dimensional  linear  program  H.  (This 
is  actually  an  abuse  of  the  terminology,  since  a  linear  program  consists  of 
constraints  and  an  objective  function.  But  since  at  this  point  we  are  only 
interested  in  feasibility,  it  is  convenient  to  ignore  the  objective  function.) 
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If  H  has  a  feasible  solution  d*,  we  identify  the  set  H'  C  H  of  half-planes  h 
for  which  the  solution  is  tight,  that  is,  where  d*  rfx  +  rjy  =  0.  We  still  need  to 
verify  that  the  system  H'  is  feasible.  Are  we  again  left  with  a  2-dimensional 
linear  programming  problem?  Yes,  but  a  very  special  one:  For  each  h  £  H' 
the  normal  T](/z)  is  orthogonal  to  d  =  (d*,  1),  and  that  means  that  the  bounding 
line  of  h  is  parallel  to  d.  In  other  words,  all  half-planes  in  H'  are  bounded 
by  parallel  lines,  and  by  intersecting  them  with  the  v-axis,  we  have  again  a 
1 -dimensional  linear  program  H' .  If  H'  is  feasible,  then  the  original  linear 
program  is  unbounded,  and  we  can  construct  a  feasible  ray  p  in  time  0(n)  as  in 
the  lemma  above.  If  H'  is  infeasible,  then  so  is  H'  and  therefore  H. 

If  H  does  not  have  a  feasible  solution,  by  the  lemma  above  the  original 
linear  program  ( H ,  c)  is  bounded.  Can  we  extract  some  more  information  in  this 
case?  Recall  the  solution  for  1-dimensional  linear  programs:  H  is  infeasible  if 
and  only  if  the  maximum  boundary  of  a  half-line  h\  bounded  to  the  left  is  larger 
than  the  minimum  boundary  of  a  half-line  I12  bounded  to  the  right.  These  two 
half-lines  h\  and  I12  have  an  empty  intersection.  If  h\  and  ho  are  the  original 
half-planes  that  correspond  to  these  two  constraints,  then  this  is  equivalent  to 
saying  that  {{h\ j/zo},?)  is  bounded.  We  can  call  h\  and  I12  certificates :  they 
‘prove’  that  ( H,c )  is  really  bounded. 

How  useful  certificates  are  becomes  clear  with  the  following  observation: 
After  finding  the  two  certificates  h\  and  /zi,  we  can  use  them  like  mi  and  m2  in 
2dRandomizedBoundedLP.  That  means  that  we  no  longer  need  to  make  an 
artificial  restriction  on  the  range  in  which  we  allow  the  solution  to  lie. 

Again,  we  must  be  careful.  It  can  happen  that  the  linear  program  ({h\  J12} ,c) 
is  bounded,  but  has  no  lexicographically  smallest  solution.  This  is  the  case 
when  the  1 -dimensional  linear  program  is  infeasible  due  to  a  single  constraint 
hi,  namely  when  77 ( /z  1 )  =  —  c=  (0,-1).  In  that  case  we  scan  the  remaining 
list  of  half-planes  for  a  half-plane  I12  with  rpf/z?)  >  0.  If  we  are  successful, 
h\  and  ho  are  certificates  that  guarantee  a  unique  lexicographically  smallest 
solution.  If  no  such  /zi  exists,  the  linear  program  is  either  infeasible,  or  it 
has  no  lexicographically  smallest  solution.  We  can  solve  it  by  solving  the 
1 -dimensional  linear  program  formed  by  all  half-planes  h  with  fj x(h)  =  0.  If  it 
is  feasible,  we  can  return  a  ray  p  in  direction  (—1,0),  such  that  all  points  on  p 
are  feasible  optimal  solutions. 

We  can  now  give  a  general  algorithm  for  the  2-dimensional  linear  program¬ 
ming  problem: 

Algorithm  2dRandomizedLP(//,c) 

Input.  A  linear  program  ( H,c ),  where  H  is  a  set  of  n  half-planes  and  c  £  R2. 
Output.  If  (H,c)  is  unbounded,  a  ray  is  reported.  If  it  is  infeasible,  then  two  or 
three  certificate  half-planes  are  reported.  Otherwise,  the  lexicographically 
smallest  point  p  that  maximizes  ffi p)  is  reported. 

1 .  Determine  whether  there  is  a  direction  vector  d  such  that  d  -c  >  0  and 
d-r\(h)  ^  0  for  all  h  £  H. 

2.  if  z/  exists 

3.  then  compute  H'  and  determine  whether  H'  is  feasible. 

4.  if  H'  is  feasible 
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5. 

6. 

7. 

8. 

9. 

10. 
11. 
12. 

13. 

14. 

15. 

16. 
17. 


then  Report  a  ray  proving  that  (H,  c)  is  unbounded  and  quit, 
else  Report  that  (II .c)  is  infeasible  and  quit. 

Let  h\,li2  G  H  be  certificates  proving  that  (II .c)  is  bounded  and  has  a 
unique  lexicographically  smallest  solution. 

Let  \’2  be  the  intersection  of  i\  and  G- 

Let  h 3,/i4, . . .  ,hn  be  a  random  permutation  of  the  remaining  half-planes  in 
H. 

for  i  <—  3  to  n 
do  if  v,  i  G  hi 

then  Vi <-  V/_i 

else  Vi  4— the  point  p  on  f,  that  maximizes  /?(p),  subject  to  the 
constraints  in  Hi-  j. 
if  p  does  not  exist 

then  Let  hj,hk  (with  j.  k  <  i)  be  the  certificates  (possibly 
hj  =  hk)  with  hj  fl  hk  fl  f,-  =  0. 

Report  that  the  linear  program  is  infeasible,  with 
hj,hj,hk  as  certificates,  and  quit. 

return  v„ 


We  summarize  our  results  so  far  in  the  following  theorem. 

Theorem  4.10  A  2 -dimensional  linear  programming  problem  with  n  constraints 
can  be  solved  in  0(n )  randomized  expected  time  using  worst-case  linear  storage. 


4.6*  Linear  Programming  in  Higher  Dimensions 

The  linear  programming  algorithm  presented  in  the  previous  sections  can  be 
generalized  to  higher  dimensions.  When  the  dimension  is  not  too  high,  then  the 
resulting  algorithm  compares  favorably  with  traditional  algorithms,  such  as  the 
simplex  algorithm. 

Let  H  be  a  set  of  n  closed  half-spaces  in  Given  a  vector  c  =  (ci, . . .  ,Cd),  we 
want  to  find  the  point  p  =  (pi,.. .  ,pd)  G  that  maximizes  the  linear  function 

fc(p)  :=  c\p\  H - \-CdPd,  subject  to  the  constraint  that  p  lies  in  h  for  all  h  G  H. 

To  make  sure  that  the  solution  is  unique  when  the  linear  program  is  bounded, 
we  agree  to  look  for  the  lexicographically  smallest  point  that  maximizes  /?(p). 

As  in  the  planar  version,  we  maintain  the  optimal  solution  while  incremen¬ 
tally  adding  the  half-space  constraints  one  by  one.  For  this  to  work,  we  again 
need  to  make  sure  that  there  is  a  unique  optimal  solution  at  each  step.  We  do 
this  as  in  the  previous  section:  We  first  determine  whether  the  linear  program  is 
unbounded.  If  not,  we  obtain  a  set  of  d  certificates  h\,h2,  ...,hd  &  H  that  guar¬ 
antee  that  the  solution  is  bounded  and  that  there  is  a  unique  lexicographically 
smallest  solution.  We’ll  look  at  the  details  of  finding  these  certificates  later,  and 
concentrate  on  the  main  algorithm  for  the  moment. 

Let  /;  i ,  I12 ,  ■  ■  ■ ,  hc{  be  the  d  certificate  half-spaces  obtained  by  checking  that 
the  linear  program  is  bounded,  and  let  hd+i,  hd+2,  ■  ■  ■ ,  hn  be  a  random  permuta¬ 
tion  of  the  remaining  half-spaces  in  H.  Furthermore,  define  C(-  to  be  the  feasible 
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region  when  the  first  i  half-spaces  have  been  added,  for  d  i  gi  n:  Section  4.6* 

LINEAR  PROGRAMMING  IN  HIGHER 

Q  :=  h\  fl  h2  fl  •  •  •  fl  dimensions 

Let  v,  denote  the  optimal  vertex  of  Ci,  that  is,  the  vertex  that  maximizes  /'-. 

Lemma  4.5  gave  us  an  easy  way  to  maintain  the  optimal  vertex  in  the  2- 
dimensional  case:  either  the  optimal  vertex  doesn’t  change,  or  the  new  optimal 
vertex  is  contained  in  the  line  that  bounds  the  half-plane  h,  that  we  are  adding. 

The  following  lemma  generalizes  this  result  to  higher  dimensions;  its  proof  is  a 
straightforward  generalization  of  the  proof  of  Lemma  4.5. 

Lemma  4.11  Let  1  ^  i  ^  n,  and  let  Ci  and  v,  be  defined  as  above.  Then  we  have 

(i)  If  n- 1  G  hi,  then  v,-  =  v,-i . 

(ii)  If  v,  |  ^  hi,  then  either  Ci  =  0  or  v,-  G  gi,  where  gj  is  the  hyperplane  that 
bounds  hi. 

If  we  denote  the  hyperplane  that  bounds  the  half-space  by  gj,  the  optimal 
vertex  v,  of  C,  can  be  found  by  finding  the  optimal  vertex  of  the  intersection 

gi  n  i . 

But  how  do  we  find  the  optimal  vertex  of  gj  fl  C/_  i  ?  In  two  dimensions  this 
was  easy  to  do  in  linear  time,  because  everything  was  restricted  to  a  line.  Let’s 
look  at  the  3-dimensional  case.  In  three  dimensions,  gj  is  a  plane,  and  gi  fl  C,_i 
is  a  2-dimensional  convex  polygonal  region.  What  do  we  have  to  do  to  find  the 
optimum  in  gi  fl C,_  i  ?  We  have  to  solve  a  2-dimensional  linear  program!  The 
linear  function  fy  defined  in  M3  induces  a  linear  function  in  gt,  and  we  need  to 
find  the  point  in  gi  fl  Q_  i  that  maximizes  this  function.  In  case  c  is  orthogonal  to 
gi,  all  points  on  gj  are  equally  good:  following  our  rule,  we  then  need  to  find  the 
lexicographically  smallest  solution.  We  achieve  this  by  choosing  the  objective 
function  correctly — for  instance,  when  gj  is  not  orthogonal  to  the  x\  -axis,  we 
obtain  the  vector  c  by  projecting  the  vector  (—1,0,0)  onto  gi. 

So  in  the  3-dimensional  case  we  find  the  optimal  vertex  of  gi  fl  C,-_  i  as 
follows:  we  compute  the  intersection  of  all  i  —  1  half-spaces  with  gj,  and  project 
the  vectors 


on  gj  until  a  projection  is  non-zero.  This  results  in  a  linear  program  in  two 
dimensions,  which  we  solve  using  algorithm  2dRandomizedLP. 

By  now  you  can  probably  guess  how  we  will  attack  the  general,  d -dimensional 
case.  There,  gi  is  a  hyperplane,  a  (d  —  l)-dimensional  subspace,  and  we  have 
to  find  the  point  in  the  intersection  Q_ \  fig,-  that  maximizes  /?.  This  is  a  linear 
program  in  d  —  1  dimensions,  and  so  we  will  solve  it  by  making  a  recursive  call 
to  the  ( d  —  1) -dimensional  version  of  our  algorithm.  The  recursion  bottoms  out 
when  we  get  to  a  1 -dimensional  linear  program,  which  can  be  solved  directly  in 
linear  time. 

We  still  need  to  determine  whether  the  linear  program  is  unbounded,  and  to 
find  suitable  certificates  if  that  is  not  the  case.  We  first  verify  that  Lemma  4.9 
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Chapter  4  holds  in  arbitrary  dimensions.  The  lemma  and  its  proof  need  no  change.  The 
linear  programming  lemma  implies  that  the  ^-dimensional  linear  program  ( H,c )  is  bounded  if  and 
only  if  a  certain  ( d  —  1 ) -dimensional  linear  program  is  infeasible.  We  will  solve 
this  ( d  —  1) -dimensional  linear  program  by  a  recursive  call. 

If  the  ( d  —  1) -dimensional  linear  program  is  feasible,  we  obtain  a  direction 
vector  d.  The  <i-dimensional  linear  program  is  then  either  unbounded  in  di¬ 
rection  d,  or  infeasible.  This  can  be  determined  by  verifying  whether  (//' ,c) 
is  feasible,  where  H'  is  as  defined  in  Lemma  4.9.  The  boundaries  of  all  the 
half-spaces  in  H'  are  parallel  to  d,  and  so  this  can  be  decided  by  solving  a 
second  (d  —  1) -dimensional  program,  with  a  second  recursive  call. 

If  the  (d  —  1) -dimensional  linear  program  is  infeasible,  its  solution  will  give 
us  k  certificate  half-spaces  h\ , I12,  ■ . . , %  G  H,  with  k  <d,  that  ‘prove’  that  (H . c) 
is  bounded.  If  k  <  d,  then  the  set  of  optimal  solutions  to  ({hi,. . . . hi } . c)  is 
unbounded.  In  that  case,  these  optimal  solutions  form  a  (d  —  k) -dimensional 
subspace.  We  determine  whether  the  linear  program  restricted  to  this  subspace 
is  bounded  with  respect  to  the  lexicographical  order.  If  not,  we  can  report 
the  solution,  otherwise  we  can  repeat  the  process  until  we  obtain  a  set  of 
d  certificates  with  a  unique  solution. 

The  global  algorithm  now  looks  as  follows.  Again  we  use  g;  to  denote  the 
hyperplane  that  bounds  the  half-space 

Algorithm  RandomizedLP(7/,c) 

Input.  A  linear  program  ( H,c ),  where  H  is  a  set  of  n  half-spaces  in  M.d  and 

cG  Rd. 

Output.  If  ( H,c )  is  unbounded,  a  ray  is  reported.  If  it  is  infeasible,  then  at  most 
d  +  1  certificate  half-planes  are  reported.  Otherwise,  the  lexicographically 
smallest  point  p  that  maximizes  fz(p)  is  reported. 

1 .  Determine  whether  a  direction  vector  d  exists  such  that  d  c  >  0  and 
d-rj(h)^Q  for  all  h  G  H. 

2.  if  d  exists 

3.  then  compute  H'  and  determine  whether  H'  is  feasible. 

4.  if  H'  is  feasible 

5.  then  Report  a  ray  proving  that  ( H,c )  is  unbounded  and  quit. 

6.  else  Report  that  (H,c)  is  infeasible,  provide  certificates,  and 

quit. 

7.  Let  hi,h2,--.,hd  be  certificates  proving  that  (H, c )  is  bounded. 

8.  Let  v,i  be  the  intersection  of  gi  ,g2,  ■■  -,gd- 

9.  Compute  a  random  permutation  h^+i , . . . ,  h„  of  the  remaining  half-spaces 
in  H. 

10.  for  i  <—  d+  1  to  n 

11.  do  if  v,_i  G  hi 

12.  then  v,  <—  v,-_i 

13.  else  v',-  <—  the  point  p  on  g,  that  maximizes  f?(p),  subject  to  the 

constraints  {hi,...  } 

14.  if  p  does  not  exist 

15.  then  Let  H*  be  the  at  most  d  certificates  for  the  infeasi¬ 

bility  of  the  (d  —  1) -dimensional  program. 
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16. 


Report  that  the  linear  program  is  infeasible,  with  Section  4.6* 

H*  U  hj  as  certificates,  and  quit.  linear  programming  in  higher 

17.  return  v„  dimensions 

The  following  theorem  states  the  performance  of  RandomizedLP.  Although 
we  consider  d  a  constant,  which  means  we  can  state  an  0(n)  bound  on  the 
running  time,  it  is  useful  to  have  a  close  look  at  the  dependency  of  the  running 
time  on  d — see  the  end  of  the  proof  the  following  theorem. 

Theorem  4.12  For  each  fixed  dimension  d,  ad -dimensional  linear  programming 
problem  with  n  constraints  can  be  solved  in  0(n)  expected  time. 

Proof.  We  must  prove  that  there  is  a  constant  Cj  such  that  the  algorithm  takes 
at  most  Cdn  expected  time.  We  proceed  by  induction  on  the  dimension  d.  For 
two  dimensions,  the  result  follows  from  Theorem  4.10,  so  let’s  assume  d  >2. 

The  induction  step  is  basically  identical  to  the  proof  of  the  2-dimensional  cases. 

We  start  by  solving  at  most  d  linear  programs  of  dimension  d  -  1 .  By  the 
induction  assumption,  this  takes  time  0(dn)  +  dCd-\n. 

The  algorithm  spends  0(d)  time  to  compute  iy/ .  Testing  whether  v,_i  €  h, 
takes  0(d)  time.  The  running  time  is  therefore  0(dn)  as  long  as  we  do  not 
count  the  time  spent  in  line  13. 

In  line  13,  we  need  to  project  c  on  gt,  in  time  0(d),  and  to  intersect  i 
half-spaces  with  gi,  in  time  0(di).  Furthermore,  we  make  a  recursive  call  with 
dimension  d  —  1  and  i  —  1  half-spaces. 

Define  a  random  variable  X ),  which  is  1  if  v,_i  f  hi,  and  0  otherwise.  The 
total  expected  time  spent  by  the  algorithm  is  bounded  by 

n 

0(dn)+dCd-in+  ^  (0(di)  +  Cd-\(i  —  1)) -E[Xj], 
i=d+ 1 


To  bound  FjAj],  we  apply  backwards  analysis.  Consider  the  situation  after 
adding  hi,...,  hj.  The  optimum  point  is  a  vertex  v/  of  Q,  so  it  is  defined  by  d  of 
the  half-spaces.  Now  we  make  one  step  backwards  in  time.  The  optimum  point 
changes  only  if  we  remove  one  of  the  half-spaces  defining  v,-.  Since  hd+ 1,  ■  ■  ■ ,  hj 
is  a  random  permutation,  the  probability  that  this  happens  is  at  most  d/(i  —  d). 

Consequently,  we  get  the  following  bound  for  the  expected  running  time  of 
the  algorithm: 


«  d 

0(dn)+dCd-in+  ^  (0(di)  +  Cd-\(i-  1))t— , 

i=d+ 1  l~  d 

This  can  be  bounded  by  Cdn,  with  Q  =  0(Cd~\d),  so  C,i  =  0(cdd\)  for  a 
constant  c  indpendent  on  the  dimension.  0 

When  d  is  a  constant,  it  is  correct  to  say  that  the  algorithm  runs  in  linear 
time.  Still,  that  would  be  quite  misleading.  The  constant  factor  Q  grows  so  fast 
as  a  function  of  d  that  this  algorithm  is  useful  only  for  rather  small  dimensions. 
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Chapter 4  4.7*  Smallest  Enclosing  Discs 

LINEAR  PROGRAMMING 

The  simple  randomized  technique  we  used  above  turns  out  to  be  surprisingly 
powerful.  It  can  be  applied  not  only  to  linear  programming  but  to  a  variety  of 
other  optimization  problems  as  well.  In  this  section  we  shall  look  at  one  such 
problem. 

Consider  a  robot  arm  whose  base  is  fixed  to  the  work  floor.  The  arm  has  to  pick 
up  items  at  various  points  and  place  them  at  other  points.  What  would  be  a  good 
position  for  the  base  of  the  arm?  This  would  be  somewhere  “in  the  middle”  of 
the  points  it  must  be  able  to  reach.  More  precisely,  a  good  position  is  at  the 
center  of  the  smallest  disc  that  encloses  all  the  points.  This  point  minimizes  the 
maximum  distance  between  the  base  of  the  arm  and  any  point  it  has  to  reach. 
We  arrive  at  the  following  problem:  given  a  set  P  of  n  points  in  the  plane  (the 
points  on  the  work  floor  that  the  arm  must  be  able  to  reach),  find  the  smallest 
enclosing  disc  for  P,  that  is,  the  smallest  disc  that  contains  all  the  points  of  P. 
This  smallest  enclosing  disc  is  unique — see  Lemma  4. 14(i)  below,  which  is  a 
generalization  of  this  statement. 
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As  in  the  previous  sections,  we  will  give  a  randomized  incremental  algorithm  for 
the  problem:  First  we  generate  a  random  permutation  p\, . . .  ,pn  of  the  points  in 
P.  Let  Pj  :  =  {p  \ ... .  ,pi}.  We  add  the  points  one  by  one  while  we  maintain  Dj, 
the  smallest  enclosing  disc  of  Pj. 

In  the  case  of  linear  programming,  there  was  a  nice  fact  that  helped  us  to 
maintain  the  optimal  vertex:  when  the  current  optimal  vertex  is  contained  in  the 
next  half-plane  then  it  does  not  change,  and  otherwise  the  new  optimal  vertex 
lies  on  the  boundary  of  the  half-plane.  Is  a  similar  statement  true  for  smallest 
enclosing  discs?  The  answer  is  yes: 

Lemma  4.13  Let  2  <  i  <  n,  and  let  Pj  and  Dj  be  defined  as  above.  Then  we 
have 

(i)  If  pi  G  Dj  i ,  then  Dj  =  Dj- 1 . 

(ii)  If  pi  (jl  D,  i ,  then  pj  lies  on  the  boundary  ofDj. 

We  shall  prove  this  lemma  later,  after  we  have  seen  how  we  can  use  it  to 
design  a  randomized  incremental  algorithm  that  is  quite  similar  to  the  linear 
programming  algorithm. 

Algorithm  MlNlDlSC(P) 

Input.  A  set  P  of  n  points  in  the  plane. 

Output.  The  smallest  enclosing  disc  for  P. 

1 .  Compute  a  random  permutation  p\ , . . .  ,p„  of  P. 

2.  Let  Di  be  the  smallest  enclosing  disc  for  {p\1p2\. 

3.  for  i  <—  3  to  n 

4.  do  if  pi  G  I),-  i 

5.  then  D,  <— i 

6.  else  Dj  <—  MiniDiscWithPoint({pi,  . . .  ,pi-i},pd 

7.  return  D„ 


The  critical  step  occurs  when  /;,■  (/_  D,_  i .  We  need  a  subroutine  that  finds  the 
smallest  disc  enclosing  P,,  using  the  knowledge  that  p,  must  lie  on  the  boundary 
of  that  disc.  How  do  we  implement  this  routine?  Let  q  :=  We  use  the  same 
framework  once  more:  we  add  the  points  of  1)  _  |  in  random  order,  and  maintain 
the  smallest  enclosing  disc  of  Pj_  i  U  { q }  under  the  extra  constraint  that  it  should 
have  q  on  its  boundary.  The  addition  of  a  point  pj  will  be  facilitated  by  the 
following  fact:  when  pj  is  contained  in  the  currently  smallest  enclosing  disc 
then  this  disc  remains  the  same,  and  otherwise  it  must  have  pj  on  its  boundary. 
So  in  the  latter  case,  the  disc  has  both  q  and  pj  and  its  boundary.  We  get  the 
following  subroutine. 

MiniDisc  WithPointCP,^) 

Input.  A  set  P  of  n  points  in  the  plane,  and  a  point  q  such  that  there  exists  an 
enclosing  disc  for  P  with  q  on  its  boundary. 

Output.  The  smallest  enclosing  disc  for  P  with  q  on  its  boundary. 

1 .  Compute  a  random  permutation  p\,...,pn  of  P. 

2.  Let  D\  be  the  smallest  disc  with  q  and  p\  on  its  boundary. 

3.  for  j  <—  2  to  n 

4.  do  if  pj  G  Dj- 1 

5.  then  D,  <— Z),_i 

6.  else  Dj  <—  MlNlDlSCWlTH2PoiNTS({pi, . .  .,pj-i},pj,q) 

7.  return  Dn 


How  do  we  find  the  smallest  enclosing  disc  for  a  set  under  the  restriction  that 
two  given  points  q\  and  q2  are  on  its  boundary?  We  simply  apply  the  same 
approach  one  more  time.  Thus  we  add  the  points  in  random  order  and  maintain 
the  optimal  disc;  when  the  point  pk  we  add  is  inside  the  current  disc  we  don’t 
have  to  do  anything,  and  when  p k  is  not  inside  the  current  disc  it  must  be  on 
the  boundary  of  the  new  disc.  In  the  latter  case  we  have  three  points  on  the  disc 
boundary:  q\,  c/i,  and  pk.  This  means  there  is  only  one  disc  left:  the  unique 
disc  with  q\,  q2,  and  pk  on  its  boundary.  This  following  routine  describes  this 
in  more  detail. 

MiniDisc  With2Points(.P,  <?i ,  42) 

Input.  A  set  P  of  n  points  in  the  plane,  and  two  points  q\  and  q2  such  that  there 
exists  an  enclosing  disc  for  P  with  q\  and  qi  on  its  boundary. 

Output.  The  smallest  enclosing  disc  for  P  with  q\  and  qi  on  its  boundary. 

1.  Let  Do  be  the  smallest  disc  with  q\  and  q2  on  its  boundary. 


2. 

for  k  <—  1  to  n 

3. 

do  if  pk  G  Dk 

l 

4. 

then  Dk 

At- i 

5. 

else  Dk 

<— the  disc  with  q\,  i/i,  and  pk  on  its  boundary 

6. 

return  D„ 

Section  4.7* 

SMALLEST  ENCLOSING  DISCS 


This  finally  completes  the  algorithm  for  computing  the  smallest  enclosing  disc 
of  a  set  of  points.  Before  we  analyze  it,  we  must  validate  its  correctness  by 
proving  some  facts  that  we  used  in  the  algorithms.  For  instance,  we  used  the 
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Chapter  4  fact  that  when  we  added  a  new  point  and  this  point  was  outside  the  current 
linear  programming  optimal  disc,  then  the  new  optimal  disc  must  have  this  point  on  its  boundary. 

Lemma  4.14  Let  P  be  a  set  of  points  in  the  plane,  let  R  be  a  possibly  empty  set 
of  points  with  P  HR  =  0,  and  let  p  £  P.  Then  the  following  holds: 

(i)  If  there  is  a  disc  that  encloses  P  and  has  all  points  of  R  on  its  boundary, 
then  the  smallest  such  disc  is  unique.  We  denote  it  by  md (P.  R) . 

(ii)  If  p  £  md(P\{p},R),  then  md(P,R)  =  md(P\{p},R). 

(iii)  If  p  ^  md(P\{p},R),  then  md(P,R)  =  md(P\{p},RU{p}). 

Proof,  (i)  Assume  that  there  are  two  distinct  enclosing  discs  Dq  and  D\  with 
centers  xo  and  xi,  respectively,  and  with  the  same  radius.  Clearly,  all  points 
of  P  must  lie  in  the  intersection  Do  C  D\.  We  define  a  continuous  family 
{ DiX)  |  0  ^  X  ^  1}  of  discs  as  follows.  Let  z  be  an  intersection  point  of 
3Dq  and  dD\,  the  boundaries  of  Dq  and  D\.  The  center  of  DiX)  is  the 
point  x(A)  :=  (1  —  A)xo  +  Axi,  and  the  radius  of  D(X)  is  r(X)  :=  d(x(X),z). 
We  have  Do  P  I)  \  C  DiX)  for  all  X  with  0  ^  X  ^  1  and.  in  particular,  for 
X  =  1/2.  Hence,  since  both  Dq  and  D\  enclose  all  points  of  P,  so  must 
Z)(  1/2).  Moreover,  dD(  1/2)  passes  through  the  intersection  points  of  d Do 
and  dD\ .  Because  R  C  dDo  ft  dD\,  this  implies  that  R  C  dD(  1/2)  .  In  other 
words,  D(  1  / 2)  is  an  enclosing  disc  for  P  with  R  on  its  boundary.  But  the 
radius  of  D(  1/2)  is  strictly  less  than  the  radii  of  Dq  and  D\.  So  whenever 
there  are  two  distinct  enclosing  discs  of  the  same  radius  with  R  on  their 
boundary,  then  there  is  a  smaller  enclosing  disc  with  R  on  its  boundary. 
Hence,  the  smallest  enclosing  disc  md(P,R )  is  unique. 

(ii)  Let  D  :=  md(P\  {p},/?).  If  p  €  D,  then  D  contains  P  and  has  R  on  its 
boundary.  There  cannot  be  any  smaller  disc  containing  P  with  R  on  its 
boundary,  because  such  a  disc  would  also  be  a  containing  disc  for  P  \  {/;  } 
with  R  on  its  boundary,  contradicting  the  definition  of  D.  It  follows  that 
D  =  md(P,R). 

(iii)  Let  Dq  :=  md(P  \  {p},R)  and  let  D\  :=  md(P.R).  Consider  the  family 

D(X)  of  discs  defined  above.  Note  that  Z)( 0)  =  Dq  and  Z)(l)  =  D\,  so  the 
family  defines  a  continous  deformation  of  Dq  to  D\.  By  assumption  we 
have  p  f  Dq.  We  also  have  p  £  D i,  so  by  continuity  there  must  be  some 
0  <  X*  ^  1  such  that  p  lies  on  the  boundary  of  DiX*).  As  in  the  proof  of 
(i),  we  have  P  C  D(X*)  and  R  C  dD(X*).  Since  the  radius  of  any  D(X)  with 
0  <  X  <  1  is  strictly  less  than  the  radius  of  D\ ,  and  D\  is  by  definition  the 
smallest  enclosing  disc  for  P,  we  must  have  X*  =  1.  In  other  words,  D\  has 
p  on  its  boundary.  ED 

Lemma  4.14  implies  that  MlNlDlSC  correctly  computes  the  smallest  enclos¬ 
ing  disc  of  a  set  of  points.  The  analysis  of  the  running  time  is  given  in  the  proof 
of  the  following  theorem. 

Theorem  4.15  The  smallest  enclosing  disc  for  a  set  of  n  points  in  the  plane  can 
be  computed  in  0(n)  expected  time  using  worst-case  linear  storage. 
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Proof.  MlNlDlSC With2Points  runs  in  0{n)  time  because  every  iteration  of 
the  loop  takes  constant  time,  and  it  uses  linear  storage.  MiniDiscWithPoint 


and  MlNlDlSC  also  need  linear  storage,  so  what  remains  is  to  analyze  their 
expected  running  time. 

The  running  time  of  MiniDiscWithPoint  is  0(n)  as  long  as  we  don’t 
count  the  time  spent  in  calls  to  MiniDiscWith2Points.  What  is  the  prob¬ 
ability  of  having  to  make  such  a  call?  Again  we  use  backwards  analysis  to 
bound  this  probability:  Fix  a  subset  {p\ ,/?;},  and  let  Z),  be  the  smallest  disc 
enclosing  {p\,. . . , /?, }  and  having  q  on  its  boundary.  Imagine  that  we  remove 
one  of  the  points  {pi , . . .  ,p,}.  When  does  the  smallest  enclosing  circle  change? 
That  happens  only  when  we  remove  one  of  the  three  points  on  the  boundary. 
One  of  the  points  on  the  boundary  is  q ,  so  there  are  at  most  two  points  that  cause 
the  smallest  enclosing  circle  to  shrink.  The  probability  that  p\  is  one  of  those 
points  is  2/ i.  (When  there  are  more  than  three  points  on  the  boundary,  then  the 
probability  that  the  smallest  enclosing  circle  changes  can  only  get  smaller.)  So 
we  can  bound  the  total  expected  running  time  of  MiniDiscWithPoint  by 

<900 +  £  0(0  7  =  0(n). 

i=2  ' 

Applying  the  same  argument  once  more,  we  find  that  the  expected  running  time 
of  MlNlDlSC  is  0(ri)  as  well.  0 

Algorithm  MlNlDlSC  can  be  improved  in  various  ways.  First  of  all,  it  is  not 
necessary  to  use  a  fresh  random  permutation  in  every  instance  of  subroutine 
MiniDiscWithPoint.  Instead,  one  can  compute  a  permutation  once,  at 
the  start  of  MiniDisc,  and  pass  the  permutation  to  MiniDiscWithPoint. 
Furthermore,  instead  of  writing  three  different  routines,  one  could  write  a  single 
algorithm  MlNlDlSCWrmPoiNTSf/Z  R)  that  computes  md(P.  R)  as  defined  in 
Lemma  4.14. 


4.8  Notes  and  Comments 

In  this  chapter  we  have  studied  an  algorithmic  problem  that  arises  when  one 
wants  to  manufacture  an  object  using  casting.  Other  manufacturing  processes 
lead  to  challenging  algorithmic  problems  as  well,  and  a  number  of  such  problems 
have  been  studied  within  computational  geometry  over  the  past  years — see  for 
example  the  book  by  Dutta  et  al.  [152]  or  the  surveys  by  Janardan  and  Woo  [220] 
and  Bose  and  Toussaint  [72], 

The  computation  of  the  common  intersection  of  half-planes  is  an  old  and 
well-studied  problem.  As  we  will  explain  in  Chapter  1 1,  the  problem  is  dual  to 
the  computation  of  the  convex  hull  of  points  in  the  plane.  Both  problems  have  a 
long  history  in  the  field,  and  Preparata  and  Shamos  [323]  already  list  a  number 
of  solutions.  More  information  on  the  computation  of  2-dimensional  convex 
hulls  can  be  found  in  the  notes  and  comments  of  Chapter  1. 

Computing  the  common  intersection  of  half-spaces,  which  can  be  done 
in  O(n\ogn)  time  in  the  plane  and  in  3-dimensional  space,  becomes  a  more 
computationally  demanding  problem  when  the  dimension  increases.  The  reason 
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Chapter  4  is  that  the  number  of  (lower-dimensional)  faces  of  the  convex  polytope  formed 
linear  programming  as  the  common  intersection  can  be  as  large  as  @(«Lrf/2J)  [158].  So  if  the  only 
goal  is  to  find  a  feasible  point,  computing  the  common  intersection  explicitly 
soon  becomes  an  unattractive  approach. 

Linear  programming  is  one  of  the  basic  problems  in  numerical  analysis  and 
combinatorial  optimization.  It  goes  beyond  the  scope  of  this  chapter  to  survey 
this  literature,  and  we  restrict  ourselves  to  mentioning  the  simplex  algorithm 
and  its  variants  [139],  and  the  polynomial-time  solutions  of  Khachiyan  [234] 
and  Karmarkar  [227].  More  information  on  linear  programming  can  be  found 
in  books  by  Chvatal  [129]  and  Schrijver  [339]. 

Linear  programming  as  a  problem  in  computational  geometry  was  first 
considered  by  Megiddo  [273],  who  showed  that  the  problem  of  testing  whether 
the  intersection  of  half-spaces  is  empty  is  strictly  simpler  than  the  computa¬ 
tion  of  the  intersection.  He  gave  the  first  deterministic  algorithm  for  linear 
programming  whose  running  time  is  of  the  form  OiCjn),  where  C,j  is  a  factor 
depending  on  the  dimension  only.  His  algorithm  is  linear  in  n  for  any  fixed 
dimension.  The  factor  Q  in  his  algorithm  is  22  .  This  was  later  improved 
to  yh  [130,  153].  More  recently,  a  number  of  simpler  and  more  practical  ran¬ 
domized  algorithms  have  been  given  [132,  346,  354].  There  are  a  number  of 
randomized  algorithms  whose  running  time  is  subexponential,  but  still  not  poly¬ 
nomial  in  the  dimension  [222,  267].  Finding  a  strongly  polynomial  algorithm, 
that  is  of  combinatorial  polynomial  complexity,  for  linear  programming  is  one 
of  the  major  open  problems  in  the  area. 

The  simple  randomized  incremental  algorithm  for  two  and  higher  dimen¬ 
sions  given  here  is  due  to  Seidel  [346].  Unlike  in  our  presentation,  he  deals 
with  unbounded  linear  programs  by  treating  the  parameter  M  symbolically.  This 
is  probably  more  elegant  and  efficient  than  the  algorithm  we  present,  which 
was  chosen  to  demonstrate  the  relationship  between  unbounded  cZ-dimensional 
linear  programs  and  feasible  (d—  1) -dimensional  ones.  In  Seidel’s  version,  the 
factor  Cj  can  be  shown  to  be  0(d\). 

The  generalization  to  the  computation  of  smallest  enclosing  discs  is  due  to 
Welzl  [385],  who  also  showed  how  to  find  the  smallest  enclosing  ball  of  a  set 
of  points  in  higher  dimensions,  and  the  smallest  enclosing  ellipse  or  ellipsoid. 
Sharir  and  Welzl  further  generalized  the  technique  and  introduced  the  notion 
of  LP-type  problems,  which  can  be  solved  efficiently  with  an  algorithm  similar 
to  the  ones  described  here  [189,  354].  Generally  speaking,  the  technique  is 
applicable  to  optimization  problems  where  the  solution  either  does  not  change 
when  a  new  constraint  is  added,  or  the  solution  is  partially  defined  by  the  new 
constraint  so  that  the  dimension  of  the  problem  is  reduced.  It  has  also  been 
shown  that  the  special  properties  of  LP-type  problems  give  rise  to  so-called 
Helly-type  theorems  [16]. 

Randomization  is  a  technique  that  often  produces  algorithms  that  are  simple  and 
efficient.  We  will  see  more  examples  in  the  following  chapters.  The  price  we 
pay  is  that  the  running  time  is  only  an  expected  bound  and — as  we  observed — 
there  is  a  certain  chance  that  the  algorithm  takes  much  longer.  Some  people 
take  this  as  a  reason  to  say  that  randomized  algorithms  cannot  be  trusted  and 
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shouldn’t  be  used  (think  of  a  computer  in  an  intensive  care  station  in  a  hospital, 
or  in  a  nuclear  power  plant). 

On  the  other  hand,  deterministic  algorithms  are  only  perfect  in  theory.  In 
practice,  any  non-trivial  algorithm  may  contain  bugs,  and  even  if  we  neglect 
this,  there  is  the  risk  of  hardware  malfunction  or  “soft  errors”:  single  bits  in 
core  memory  flipping  under  the  influence  of  ambient  a-radiation.  Because 
randomized  algorithms  are  often  much  simpler  and  have  shorter  code,  the 
probability  of  such  a  mishap  is  smaller.  Therefore  the  total  probability  that  a 
randomized  algorithm  fails  to  produce  the  correct  answer  in  time  need  not  be 
larger  than  the  probability  that  a  deterministic  algorithm  fails.  Moreover,  we 
can  always  reduce  the  probability  that  the  actual  running  time  of  a  randomized 
algorithm  exceeds  its  expected  running  time  by  allowing  a  larger  constant  in  the 
expected  running  time. 


4.9  Exercises 

4.1  In  this  chapter  we  studied  the  casting  problem  for  molds  of  one  piece.  A 
sphere  cannot  be  manufactured  in  this  manner,  but  it  can  be  manufactured 
if  we  use  a  two-piece  mold.  Give  an  example  of  an  object  that  cannot  be 
manufactured  with  a  two-piece  mold,  but  that  can  be  manufactured  with 
a  three-piece  mold. 

4.2  Consider  the  casting  problem  in  the  plane:  we  are  given  polygon  IP  and  a 
2-dimensional  mold  for  it.  Describe  a  linear  time  algorithm  that  decides 
whether  IP  can  be  removed  from  the  mold  by  a  single  translation. 

4.3  Suppose  that,  in  the  3-dimensional  casting  problem,  we  do  not  want  the 
object  to  slide  along  a  facet  of  the  mold  when  we  remove  it.  How  does 
this  affect  the  geometric  problem  (computing  a  point  in  the  intersection 
of  half-planes)  that  we  derived? 

4.4  Let  IP  be  a  castable  simple  polyhedron  with  top  facet  /.  Let  d  be  a  removal 
direction  for  IP.  Show  that  any  line  with  direction  d  intersects  7  if  and 
only  if  it  intersects  /.  Also  show  that  for  any  line  t  with  direction  d,  the 
intersection  £  IT  CP  is  connected. 

4.5  Let  IP  be  a  simple  polyhedron  with  n  vertices.  If  7  is  castable  with  some 
facet  /  as  top  facet,  then  a  necessary  condition  is  that  the  facets  adjacent 
to  /  must  lie  completely  to  one  side  of  hf,  the  plane  through  /.  (The 
reverse  is  not  necessarily  true,  of  course:  if  all  adjacent  facets  lie  to  one 
side  of  h  r  then  7  is  not  necessarily  castable  with  /  as  top  facet.)  Give  a 
linear  time  algorithm  to  compute  all  facets  of  7  for  which  this  condition 
holds. 

4.6*  Consider  the  restricted  version  of  the  casting  problem  in  which  we  insist 
that  the  object  is  removed  from  its  mold  using  a  vertical  translation 
(perpendicular  to  the  top  facet). 
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a.  Prove  that  in  this  case  there  is  always  only  a  constant  number  of 
possible  top  facets. 

b.  Give  a  linear  time  algorithm  that  determines  whether  for  a  given  object 
a  mold  exists  under  this  restricted  model. 
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4.7  Instead  of  removing  the  object  from  the  mold  by  a  single  translation,  we 
can  also  try  to  remove  it  by  a  single  rotation.  For  simplicity,  let’s  consider 
the  planar  variant  of  this  version  of  the  casting  problem,  and  let’s  only 
look  at  clockwise  rotations. 

a.  Give  an  example  of  a  simple  polygon  T  with  top  facet  /  that  is  not 
castable  when  we  require  that  7  should  be  removed  from  the  mold  by 
a  single  translation,  but  that  is  castable  using  rotation  around  a  point. 
Also  give  an  example  of  a  simple  polygon  IP  with  top  facet  /  that  is 
not  castable  when  we  require  that  7  should  be  removed  from  the  mold 
by  a  rotation,  but  that  is  castable  using  a  single  translation. 

b.  Show  that  the  problem  of  finding  a  center  of  rotation  that  allows  us 
to  remove  7  with  a  single  rotation  from  its  mold  can  be  reduced  to 
the  problem  of  finding  a  point  in  the  common  intersection  of  a  set  of 
half-planes. 

4.8  The  plane  z  =  1  can  be  used  to  represent  all  directions  of  vectors  in  3- 
dimensional  space  that  have  a  positive  z- value.  How  can  we  represent 
all  directions  of  vectors  in  3-dimensional  space  that  have  a  non-negative 
7- value?  And  how  can  we  represent  the  directions  of  all  vectors  in  3- 
dimensional  space? 

4.9  Suppose  we  want  to  find  all  optimal  solutions  to  a  3-dimensional  linear 
program  with  n  constraints.  Argue  that  Cl(n\ogn)  is  a  lower  bound  for 
the  worst-case  time  complexity  of  any  algorithm  solving  this  problem. 

4.10  Let  H  be  a  set  of  at  least  three  half-planes  with  a  non-empty  intersection 
such  that  not  all  bounding  lines  are  parallel.  We  call  a  half-plane  h  G  // 
redundant  if  it  does  not  contribute  an  edge  to  fj  H.  Prove  that  for  any 
redundant  half-plane  h  £  H  there  are  two  half-planes  h! ,  h"  £  H  such  that 
hrC\h"  C  h.  Give  an  0(n  log/;)  time  algorithm  to  compute  all  redundant 
half-planes. 

4.1 1  Give  an  example  of  a  2-dimensional  linear  program  that  is  bounded,  but 
where  there  is  no  lexicographically  smallest  solution. 

4.12  Prove  that  RandomPermutation(A)  is  correct,  that  is,  prove  that  every 
possible  permutation  of  A  is  equally  likely  to  be  the  output.  Also  show  that 
the  algorithm  is  no  longer  correct  (it  no  longer  produces  every  permutation 
with  equal  probability)  if  we  change  the  k  in  line  2  to  n. 

4.13  In  the  text  we  gave  a  linear  time  algorithm  for  computing  a  random 
permutation.  The  algorithm  needed  a  random  number  generator  that  can 
produce  a  random  integer  between  1  and  n  in  constant  time.  Now  assume 
we  have  a  restricted  random  number  generator  available  that  can  only 


generate  a  random  bit  (0  or  1)  in  constant  time.  How  can  we  generate  a 
random  permutation  with  this  restricted  random  number  generator?  What 
is  the  running  time  of  your  procedure? 

4.14  Here  is  a  paranoid  algorithm  to  compute  the  maximum  of  a  set  A  of  n  real 
numbers: 

Algorithm  Fa r a x O idM a x i m u M (A ) 

1.  ifcard(A)  =  l 

2.  then  return  the  unique  element  x  G  A 

3.  else  Pick  a  random  element  x  from  A. 

4.  x'  <—  ParanoidMaximumIA  \  {x}) 

5.  if  x  <  x' 

6.  then  return  x' 

7.  else  Now  we  suspect  that  x  is  the  maximum,  but  to  be 

absolutely  sure,  we  compare  x  with  all  card(A)  —  1 
other  elements  of  A. 

8.  return  x 

What  is  the  worst-case  running  time  of  this  algorithm?  What  is  the 
expected  running  time  (with  respect  to  the  random  choice  in  line  3)? 

4.15  A  simple  polygon  CP  is  called  star-shaped  if  it  contains  a  point  q  such 
that  for  any  point  p  in  CP  the  line  segment  ~pq  is  contained  in  CP.  Give 
an  algorithm  whose  expected  running  time  is  linear  to  decide  whether  a 
simple  polygon  is  star-shaped. 

4.16  On  n  parallel  railway  tracks  n  trains  are  going  with  constant  speeds  vi, 
V2, ....  vn.  At  time  t  =  0  the  trains  are  at  positions  k\,k.2,  ■  ■  . ,  kn.  Give  an 
O(n\ogn)  algorithm  that  detects  all  trains  that  at  some  moment  in  time 
are  leading.  To  this  end,  use  the  algorithm  for  computing  the  intersection 
of  half-planes. 

4.17*  Show  how  to  implement  MlNlDlSC  using  a  single  routine  MlNlDlSC- 
WithPointsIP. /?)  that  computes  nid(P.R)  as  defined  in  Lemma  4.14. 
Your  algorithm  should  compute  only  a  single  random  permutation  during 
the  whole  computation. 
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5  Orthogonal  Range  Searching 

Querying  a  Database 


At  first  sight  it  seems  that  databases  have  little  to  do  with  geometry.  Nevertheless, 
many  types  of  questions — from  now  on  called  queries — about  data  in  a  database 
can  be  interpreted  geometrically.  To  this  end  we  transform  records  in  a  database 
into  points  in  a  multi-dimensional  space,  and  we  transform  the  queries  about 
the  records  into  queries  on  this  set  of  points.  Let’s  demonstrate  this  with  an 
example. 


salary 


Figure  5. 1 

Interpreting  a  database  query 
geometrically 


Consider  a  database  for  personnel  administration.  In  such  a  database  the 
name,  address,  date  of  birth,  salary,  and  so  on,  of  each  employee  are  stored.  A 
typical  query  one  may  want  to  perform  is  to  report  all  employees  born  between 
1950  and  1955  who  earn  between  $3,000  and  $4,000  a  month.  To  formulate 
this  as  a  geometric  problem  we  represent  each  employee  by  a  point  in  the 
plane.  The  first  coordinate  of  the  point  is  the  date  of  birth,  represented  by  the 
integer  10,000  x  year+  100  x  month  +  day ,  and  the  second  coordinate  is  the 
monthly  salary.  With  the  point  we  also  store  the  other  information  we  have 
about  the  employee,  such  as  name  and  address.  The  database  query  asking 
for  all  employees  born  between  1950  and  1955  who  earn  between  $3,000  and 
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19,500,000  19,559,999 


$4,000  transforms  into  the  following  geometric  query:  report  all  points  whose 
first  coordinate  lies  between  19,500,000  and  19,559,999,  and  whose  second 
coordinate  lies  between  3,000  and  4,000.  In  other  words,  we  want  to  report  all 
the  points  inside  an  axis-parallel  query  rectangle — see  Figure  5.1. 

What  if  we  also  have  information  about  the  number  of  children  of  each 
employee,  and  we  would  like  to  be  able  to  ask  queries  like  “report  all  employees 
born  between  1950  and  1955  who  earn  between  $3,000  and  $4,000  a  month  and 
have  between  two  and  four  children”?  In  this  case  we  represent  each  employee 
by  a  point  in  3-dimensional  space:  the  first  coordinate  represents  the  date  of 
birth,  the  second  coordinate  the  salary,  and  the  third  coordinate  the  number  of 
children.  To  answer  the  query  we  now  have  to  report  all  points  inside  the  axis- 
parallel  box  [19,500,000:  19,559,999]  x  [3,000  :  4,000]  x  [2  :  4],  In  general,  if 
we  are  interested  in  answering  queries  on  d  fields  of  the  records  in  our  database, 
we  transform  the  records  to  points  in  ('/-dimensional  space.  A  query  asking  to 
report  all  records  whose  fields  lie  between  specified  values  then  transforms  to 
a  query  asking  for  all  points  inside  a  ^-dimensional  axis-parallel  box.  Such  a 
query  is  called  a  rectangular  range  query,  or  an  orthogonal  range  query ,  in 
computational  geometry.  In  this  chapter  we  shall  study  data  structures  for  such 
queries. 


5.1  1-Dimensional  Range  Searching 

Before  we  try  to  tackle  the  2-  or  higher-dimensional  rectangular  range  searching 
problem,  let’s  have  a  look  at  the  1 -dimensional  version.  The  data  we  are  given 
is  a  set  of  points  in  1 -dimensional  space — in  other  words,  a  set  of  real  numbers. 
A  query  asks  for  the  points  inside  a  1 -dimensional  query  rectangle — in  other 
words,  an  interval  [x  :  x'] . 

Let  P  :=  {p\ ,p2,  . . .  ,pn}  be  the  given  set  of  points  on  the  real  line.  We  can 
solve  the  1 -dimensional  range  searching  problem  efficiently  using  a  well-known 
data  structure:  a  balanced  binary  search  tree  T.  (A  solution  that  uses  an  array  is 
also  possible.  This  solution  does  not  generalize  to  higher  dimensions,  however, 
nor  does  it  allow  for  efficient  updates  on  P.)  The  leaves  of  T  store  the  points 
of  P  and  the  internal  nodes  of  T  store  splitting  values  to  guide  the  search.  We 
denote  the  splitting  value  stored  at  a  node  v  by  xv.  We  assume  that  the  left 
subtree  of  a  node  v  contains  all  the  points  smaller  than  or  equal  to  xv,  and  that 
the  right  subtree  contains  all  the  points  strictly  greater  than  xv- 

To  report  the  points  in  a  query  range  [x  :  x!\  we  proceed  as  follows.  We 
search  with  x  and  x'  in  T.  Let  /i  and  t('  be  the  two  leaves  where  the  searches 
end,  respectively.  Then  the  points  in  the  interval  [x :  x1}  are  the  ones  stored  in  the 
leaves  in  between  /i  and  it '  plus,  possibly,  the  point  stored  at  ti  and  the  point 
stored  at  p'.  When  we  search  with  the  interval  [18  :  77]  in  the  tree  of  Figure  5.2, 
for  instance,  we  have  to  report  all  the  points  stored  in  the  dark  grey  leaves,  plus 
the  point  stored  in  the  leaf  p.  How  can  we  find  the  leaves  in  between  it  and 
p'l  As  Figure  5.2  already  suggests,  they  are  the  leaves  of  certain  subtrees  in 
between  the  search  paths  to  p  and  p ' .  (In  Figure  5.2,  these  subtrees  are  dark 
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grey,  whereas  the  nodes  on  the  search  paths  are  light  grey.)  More  precisely,  the 
subtrees  that  we  select  are  rooted  at  nodes  v  in  between  the  two  search  paths 
whose  parents  are  on  the  search  path.  To  find  these  nodes  we  first  search  for 
the  node  vspiit  where  the  paths  to  x  and  x'  split.  This  is  done  with  the  following 
subroutine.  Let  lc(v )  and  rc(v)  denote  the  left  and  right  child,  respectively,  of  a 
node  v. 

FindSplitNode(T,x,x/) 

Input.  A  tree  T  and  two  values  x  and  x'  with  x  ^  x' . 

Output.  The  node  v  where  the  paths  to  x  and  x'  split,  or  the  leaf  where  both 
paths  end. 

1.  V  < —  roof(T) 

2.  while  v  is  not  a  leaf  and  (x'  <  xv  or  x  >  xv) 

3.  doitV^Xy 

4.  then  v  <—  lc(v) 

5.  else  v  <—  rc(v) 

6.  return  v 

Starting  from  vspijt  we  then  follow  the  search  path  of  x.  At  each  node  where  the 
path  goes  left,  we  report  all  the  leaves  in  the  right  subtree,  because  this  subtree 
is  in  between  the  two  search  paths.  Similarly,  we  follow  the  path  of  xJ  and  we 
report  the  leaves  in  the  left  subtree  of  nodes  where  the  path  goes  right.  Finally, 
we  have  to  check  the  points  stored  at  the  leaves  where  the  paths  end;  they  may 
or  may  not  lie  in  the  range  [x  :  X7]. 

Next  we  describe  the  query  algorithm  in  more  detail.  It  uses  a  subroutine 
ReportSubtree,  which  traverses  the  subtree  rooted  at  a  given  node  and 
reports  the  points  stored  at  its  leaves.  Since  the  number  of  internal  nodes  of  any 
binary  tree  is  less  than  its  number  of  leaves,  this  subroutine  takes  an  amount  of 
time  that  is  linear  in  the  number  of  reported  points. 

Algorithm  1DRangeQuery(7,  [x  :  x']) 

Input.  A  binary  search  tree  T  and  a  range  [x  :  x']. 

Output.  All  points  stored  in  7  that  lie  in  the  range. 

1-  vspiit  <— FindSplitNode(7,x,x') 

2.  if  vspiit  is  a  leaf 

3.  then  Check  if  the  point  stored  at  vspiit  must  be  reported. 
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Figure  5.2 

A  1 -dimensional  range  query  in  a  binary 
search  tree 


root(7) 
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else  (*  Follow  the  path  to  x  and  report  the  points  in  subtrees  right  of  the 
path.  *) 

V  <-  /c(VSpiit) 

while  v  is  not  a  leaf 

do  if  x  f  xv 

then  Reports UBTREE(rc(v)) 
v  <—  Zc(v) 
else  v  <—  rc(v) 

Check  if  the  point  stored  at  the  leaf  v  must  be  reported. 

Similarly,  follow  the  path  to  x' ,  report  the  points  in  subtrees  left  of 
the  path,  and  check  if  the  point  stored  at  the  leaf  where  the  path 
ends  must  be  reported. 


We  first  prove  the  correctness  of  the  algorithm. 

Lemma  5.1  Algorithm  IDRangeQuery  reports  exactly  those  points  that  lie 
in  the  query  range. 


Proof.  We  first  show  that  any  reported  point  p  lies  in  the  query  range.  If  p  is 
stored  at  the  leaf  where  the  path  to  x  or  to  x'  ends,  then  p  is  tested  explicitly  for 
inclusion  in  the  query  range.  Otherwise,  p  is  reported  in  a  call  to  ReportSub- 
TREE.  Assume  this  call  was  made  when  we  followed  the  path  to  x.  Let  v  be  the 
node  on  the  path  such  that  p  was  reported  in  the  call  REPORTSUBTREE(rc(v)). 
Since  v  and.  hence,  rc(v)  lie  in  the  left  subtree  of  Vspiit,  we  have  p  f  xVsplit. 
Because  the  search  path  of  a'  goes  right  at  vspijt  this  means  that  p  <  x' .  On  the 
other  hand,  the  search  path  of  x  goes  left  at  v  and  p  is  in  the  right  subtree  of  v, 
so  x  <  p.  It  follows  that  p  £  [x  :  x'].  The  proof  that  p  lies  in  the  range  when  it  is 
reported  while  following  the  path  to  xJ  is  symmetrical. 

It  remains  to  prove  that  any  point  p  in  the  range  is  reported.  Let  p  be  the 
leaf  where  p  is  stored,  and  let  v  be  the  lowest  ancestor  of  p  that  is  visited  by  the 
query  algorithm.  We  claim  that  v  =  p,  which  implies  that  p  is  reported.  Assume 
for  a  contradiction  that  v  ^  p.  Observe  that  v  cannot  be  a  node  visited  in  a  call 
to  ReportSubtree,  because  all  descendants  of  such  a  node  are  visited.  Hence, 
v  is  either  on  the  search  path  to  x,  or  on  the  search  path  to  x' ,  or  both.  Because 
all  three  cases  are  similar,  we  only  consider  the  third  case.  Assume  first  that  p 
is  in  the  left  subtree  of  v.  Then  the  search  path  of  x  goes  right  at  V  (otherwise  v 
would  not  be  the  lowest  visited  ancestor).  But  this  implies  that  p  <  x.  Similarly, 
if  p  is  in  the  right  subtree  of  v,  then  the  path  of  x'  goes  left  at  V,  and  p  >  x' .  In 
both  cases,  the  assumption  that  p  lies  in  the  range  is  contradicted.  0 


We  now  turn  our  attention  to  the  performance  of  the  data  structure.  Because 
it  is  a  balanced  binary  search  tree,  it  uses  0(n)  storage  and  it  can  be  built  in 
O(nlogn)  time.  What  about  the  query  time?  In  the  worst  case  all  the  points 
could  be  in  the  query  range.  In  this  case  the  query  time  will  be  ©(h),  which 
seems  bad.  Indeed,  we  do  not  need  any  data  structure  to  achieve  0(h)  query 
time;  simply  checking  all  the  points  against  the  query  range  leads  to  the  same 
result.  On  the  other  hand,  a  query  time  of  ©(h)  cannot  be  avoided  when  we 
have  to  report  all  the  points.  Therefore  we  shall  give  a  more  refined  analysis 


of  the  query  time.  The  refined  analysis  takes  not  only  n,  the  number  of  points  Section  5.2 

in  the  set  P,  into  account,  but  also  k,  the  number  of  reported  points.  In  other  kd-trees 

words,  we  will  show  that  the  query  algorithm  is  output-sensitive,  a  concept  we 
already  encountered  in  Chapter  2. 

Recall  that  the  time  spent  in  a  call  to  ReportSubtree  is  linear  in  the 
number  of  reported  points.  Hence,  the  total  time  spent  in  all  such  calls  is  O(k). 

The  remaining  nodes  that  are  visited  are  nodes  on  the  search  path  of  x  or  x7. 

Because  T  is  balanced,  these  paths  have  length  O(logn).  The  time  we  spend 
at  each  node  is  0(1),  so  the  total  time  spent  in  these  nodes  is  O(logn),  which 
gives  a  query  time  of  Ollogn  +  k). 

The  following  theorem  summarizes  the  results  for  1 -dimensional  range 
searching: 

Theorem  5.2  Let  P  be  a  set  of  n  points  in  l -dimensional  space.  The  set  P 
can  be  stored  in  a  balanced  binary  search  tree,  which  uses  0(n)  storage  and 
has  O(nlogn)  construction  time,  such  that  the  points  in  a  query  range  can  be 
reported  in  time  O(k  +  logn),  where  k  is  the  number  of  reported  points. 


5.2  Kd-Trees 


Now  let’s  go  to  the  2-dimensional  rectangular  range  searching  problem.  Let 
P  be  a  set  of  n  points  in  the  plane.  In  the  remainder  of  this  section  we  assume 
that  no  two  points  in  P  have  the  same  x-coordinate,  and  no  two  points  have  the 
same  y-coordinate.  This  restriction  is  not  very  realistic,  especially  not  if  the 
points  represent  employees  and  the  coordinates  are  things  like  salary  or  number 
of  children.  Fortunately,  the  restriction  can  be  overcome  with  a  nice  trick  that 
we  describe  in  Section  5.5. 

A  2-dimensional  rectangular  range  query  on  P  asks  for  the  points  from  P  lying 
inside  a  query  rectangle  [x  :  x7]  x  [y  :  y'] .  A  point  p  :=  ( px,Py )  lies  inside  this 
rectangle  if  and  only  if 

Px  e  [x :  x']  and  py  €  [y :  /]. 

We  could  say  that  a  2-dimensional  rectangular  range  query  is  composed  of  two 
1 -dimensional  sub-queries,  one  on  the  x-coordinate  of  the  points  and  one  on  the 
y-coordinate. 

In  the  previous  section  we  saw  a  data  structure  for  1 -dimensional  range 
queries.  How  can  we  generalize  this  structure — which  was  just  a  binary  search 
tree — to  2-dimensional  range  queries?  Let’s  consider  the  following  recursive 
definition  of  the  binary  search  tree:  the  set  of  ( 1 -dimensional)  points  is  split  into 
two  subsets  of  roughly  equal  size;  one  subset  contains  the  points  smaller  than  or 
equal  to  the  splitting  value,  the  other  subset  contains  the  points  larger  than  the 
splitting  value.  The  splitting  value  is  stored  at  the  root,  and  the  two  subsets  are 
stored  recursively  in  the  two  subtrees. 

In  the  2-dimensional  case  each  point  has  two  values  that  are  important: 
its  x-  and  its  y-coordinate.  Therefore  we  first  split  on  x-coordinate,  next  on 
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y-coordinate,  then  again  on  x-coordinate,  and  so  on.  More  precisely,  the  process 
is  as  follows.  At  the  root  we  split  the  set  P  with  a  vertical  line  I  into  two  subsets 
of  roughly  equal  size.  The  splitting  line  is  stored  at  the  root.  fle ft,  the  subset  of 
points  to  the  left  or  on  the  splitting  line,  is  stored  in  the  left  subtree,  and  Prigho 
the  subset  to  the  right  of  it,  is  stored  in  the  right  subtree.  At  the  left  child  of  the 
root  we  split  P]eft  into  two  subsets  with  a  horizontal  line;  the  points  below  or  on 
it  are  stored  in  the  left  subtree  of  the  left  child,  and  the  points  above  it  are  stored 
in  the  right  subtree.  The  left  child  itself  stores  the  splitting  line.  Similarly,  the 
set  Prfght  is  split  with  a  horizontal  line  into  two  subsets,  which  are  stored  in  the 
left  and  right  subtree  of  the  right  child.  At  the  grandchildren  of  the  root,  we 
split  again  with  a  vertical  line.  In  general,  we  split  with  a  vertical  line  at  nodes 
whose  depth  is  even,  and  we  split  with  a  horizontal  line  at  nodes  whose  depth  is 
odd.  Figure  5.3  illustrates  how  the  splitting  is  done  and  what  the  corresponding 
binary  tree  looks  like.  A  tree  like  this  is  called  a  kd-tree.  Originally,  the  name 


Figure  5.3 

A  kd-tree:  on  the  left  the  way  the  plane 
is  subdivided  and  on  the  right  the 
corresponding  binary  tree 


stood  for  A-diniensional  tree;  the  tree  we  described  above  would  be  a  2d-tree. 
Nowadays,  the  original  meaning  is  lost,  and  what  used  to  be  called  a  2d-tree  is 
now  called  a  2-dimensional  kd-tree. 

We  can  construct  a  kd-tree  with  the  recursive  procedure  described  below. 
This  procedure  has  two  parameters:  a  set  of  points  and  an  integer.  The  first 
parameter  is  the  set  for  which  we  want  to  build  the  kd-tree;  initially  this  is  the 
set  P.  The  second  parameter  is  depth  of  recursion  or,  in  other  words,  the  depth 
of  the  root  of  the  subtree  that  the  recursive  call  constructs.  The  depth  parameter 
is  zero  at  the  first  call.  The  depth  is  important  because,  as  explained  above, 
it  determines  whether  we  must  split  with  a  vertical  or  a  horizontal  line.  The 
procedure  returns  the  root  of  the  kd-tree. 

Algorithm  B  i:  n .  D  K  D  T  R  LH  ( P.  c/e/i  f /i) 

Input.  A  set  of  points  P  and  the  current  depth  depth. 

Output.  The  root  of  a  kd-tree  storing  P. 

1 .  if  P  contains  only  one  point 

2.  then  return  a  leaf  storing  this  point 

3.  else  if  depth  is  even 

4.  then  Split  P  into  two  subsets  with  a  vertical  line  /  through  the 

median  x-coordinate  of  the  points  in  P.  Let  Pi  be  the  set  of 
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points  to  the  left  of  £  or  on  £,  and  let  Pi  be  the  set  of  points 
to  the  right  of  £ . 

5.  else  Split  P  into  two  subsets  with  a  horizontal  line  i  through 

the  median  y-coordinate  of  the  points  in  P.  Let  P\  be  the 
set  of  points  below  £  or  on  £,  and  let  Pi  be  the  set  of  points 
above  l. 

6.  Vieft  <—  BUILDKDTREE(A,<*pf/!+  1) 

7.  vright  <—  BuildKdTree(P2, depth +1) 

8.  Create  a  node  v  storing  £,  make  Vieft  the  left  child  of  v,  and  make 
Vright  the  right  child  of  v. 

9.  return  v 

The  algorithm  uses  the  convention  that  the  point  on  the  splitting  line — the  one 
determining  the  median  x-  or  y-coordinate — belongs  to  the  subset  to  the  left  of, 
or  below,  the  splitting  line.  For  this  to  work  correctly,  the  median  of  a  set  of  n 
numbers  should  be  defined  as  the  [n/2]-th  smallest  number.  This  means  that 
the  median  of  two  values  is  the  smaller  one,  which  ensures  that  the  algorithm 
terminates. 

Before  we  come  to  the  query  algorithm,  let’s  analyze  the  construction  time 
of  a  2-dimensional  kd-tree.  The  most  expensive  step  that  is  performed  at  every 
recursive  call  is  finding  the  splitting  line.  This  requires  determining  the  median 
x-coordinate  or  the  median  y-coordinate,  depending  on  whether  the  depth  is 
even  or  odd.  Median  finding  can  be  done  in  linear  time.  Linear  time  median 
finding  algorithms,  however,  are  rather  complicated.  A  better  approach  is  to 
presort  the  set  of  points  both  on  x-  and  on  y-coordinate.  The  parameter  set  P  is 
now  passed  to  the  procedure  in  the  form  of  two  sorted  lists,  one  on  x-coordinate 
and  one  on  y-coordinate.  Given  the  two  sorted  lists,  it  is  easy  to  find  the  median 
x-coordinate  (when  the  depth  is  even)  or  the  median  y-coordinate  (when  the 
depth  is  odd)  in  linear  time.  It  is  also  easy  to  construct  the  sorted  lists  for  the 
two  recursive  calls  in  linear  time  from  the  given  lists.  Hence,  the  building  time 
T(n)  satisfies  the  recurrence 


Tin) 


0(1),  if  n  =  1 , 

0(n)+2T(\n/2]),  if  77  >  1, 


which  solves  to  0(7;  log 7*).  This  bound  subsumes  the  time  we  spend  for  presort¬ 
ing  the  points  on  x-  and  y-coordinate. 

To  bound  the  amount  of  storage  we  note  that  each  leaf  in  the  kd-tree  stores 
a  distinct  point  of  P.  Hence,  there  are  n  leaves.  Because  a  kd-tree  is  a  binary 
tree,  and  every  leaf  and  internal  node  uses  0(1)  storage,  this  implies  that  the 
total  amount  of  storage  is  0(?z).  This  leads  to  the  following  lemma. 

Lemma  5.3  A  kd-tree  for  a  set  of  n  points  uses  0(n)  storage  and  can  be  con¬ 
structed  in  0(77  log  77)  time. 


Section  5.2 

KD-TREES 


We  now  turn  to  the  query  algorithm.  The  splitting  line  stored  at  the  root 
partitions  the  plane  into  two  half-planes.  The  points  in  the  left  half-plane  are 
stored  in  the  left  subtree,  and  the  points  in  the  right  half-plane  are  stored  in  the 
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Chapter  5  right  subtree.  In  a  sense,  the  left  child  of  the  root  corresponds  to  the  left  half- 

orthogonal  range  searching  plane  and  the  right  child  corresponds  to  the  right  half-plane.  (The  convention 

used  in  BuildKdTree  that  the  point  on  the  splitting  line  belongs  to  the  left 
subset  implies  that  the  left  half-plane  is  closed  to  the  right  and  the  right  half¬ 
plane  is  open  to  the  left.)  The  other  nodes  in  a  kd-tree  correspond  to  a  region 
of  the  plane  as  well.  The  left  child  of  the  left  child  of  the  root,  for  instance, 
corresponds  to  the  region  bounded  to  the  right  by  the  splitting  line  stored  at 
the  root  and  bounded  from  above  by  the  line  stored  at  the  left  child  of  the  root. 
In  general,  the  region  corresponding  to  a  node  v  is  a  rectangle,  which  can  be 
unbounded  on  one  or  more  sides.  It  is  bounded  by  splitting  lines  stored  at 
ancestors  of  v — see  Figure  5.4.  We  denote  the  region  corresponding  to  a  node 


Figure  5.4 

Correspondence  between  nodes  in  a 
kd-tree  and  regions  in  the  plane 


v  by  region(v).  The  region  of  the  root  of  a  kd-tree  is  simply  the  whole  plane. 
Observe  that  a  point  is  stored  in  the  subtree  rooted  at  a  node  v  if  and  only  if  it 
lies  in  region(v).  For  instance,  the  subtree  of  the  node  v  in  Figure  5.4  stores 
the  points  indicated  as  black  dots.  Therefore  we  have  to  search  the  subtree 
rooted  at  v  only  if  the  query  rectangle  intersects  region(v).  This  observation 
leads  to  the  following  query  algorithm:  we  traverse  the  kd-tree,  but  visit  only 
nodes  whose  region  is  intersected  by  the  query  rectangle.  When  a  region  is 
fully  contained  in  the  query  rectangle,  we  can  report  all  the  points  stored  in 
its  subtree.  When  the  traversal  reaches  a  leaf,  we  have  to  check  whether  the 
point  stored  at  the  leaf  is  contained  in  the  query  region  and,  if  so,  report  it. 
Figure  5.5  illustrates  the  query  algorithm.  (Note  that  the  kd-tree  of  Figure  5.5 
could  not  have  been  constructed  by  Algorithm  BuildKdTree;  the  median 
wasn’t  always  chosen  as  the  split  value.)  The  grey  nodes  are  visited  when  we 
query  with  the  grey  rectangle.  The  node  marked  with  a  star  corresponds  to  a 
region  that  is  completely  contained  in  the  query  rectangle;  in  the  figure  this 
rectangular  region  is  shown  darker.  Hence,  the  dark  grey  subtree  rooted  at  this 
node  is  traversed  and  all  points  stored  in  it  are  reported.  The  other  leaves  that  are 
visited  correspond  to  regions  that  are  only  partially  inside  the  query  rectangle. 
Hence,  the  points  stored  in  them  must  be  tested  for  inclusion  in  the  query  range; 

_  this  results  in  points  p(,  and  p\\  being  reported,  and  points  pj,.  p\2,  and  pn  not 

102  being  reported.  The  query  algorithm  is  described  by  the  following  recursive 


procedure,  which  takes  as  arguments  the  root  of  a  kd-tree  and  the  query  range  R. 
It  uses  a  subroutine  ReportSubtree(v),  which  traverses  the  subtree  rooted 
at  a  node  v  and  reports  all  the  points  stored  at  its  leaves.  Recall  that  lc(v )  and 
rc(v)  denote  the  left  and  right  child  of  a  node  v,  respectively. 

Algorithm  SearchKdTree(v,R) 

Input.  The  root  of  (a  subtree  of)  a  kd-tree,  and  a  range  R. 

Output.  All  points  at  leaves  below  v  that  lie  in  the  range. 

1.  if  v  is  a  leaf 

2.  then  Report  the  point  stored  at  v  if  it  lies  in  R. 

3.  else  if  region(lc(v))  is  fully  contained  in  R 

4.  then  Reports  ubtree(/c(v)) 

5.  else  if  region(lc(v))  intersects  R 

6.  then  SearchKdTree(/c(v),R) 

7.  if  region(rc(v ))  is  fully  contained  in  R 

8.  then  ReportSubtree(Vc(v)) 

9.  else  if  region(rc(v))  intersects  R 

10.  then  SearchKdTree(Vc(v),R) 

The  main  test  the  query  algorithm  performs  is  whether  the  query  range  R 
intersects  the  region  corresponding  to  some  node  v.  To  be  able  to  do  this  test 
we  can  compute  region(v)  for  all  nodes  v  during  the  preprocessing  phase  and 
store  it,  but  this  is  not  necessary:  one  can  maintain  the  current  region  through 
the  recursive  calls  using  the  lines  stored  in  the  internal  nodes.  For  instance,  the 
region  corresponding  to  the  left  child  of  a  node  v  at  even  depth  can  be  computed 
from  region(v)  as  follows: 

region(lc(v))  =  region(v )  (T£(v)left, 


f(v)left 

f(v) 

region(lc(v)) 

region(v 

where  I(v)  is  the  splitting  line  stored  at  v,  and  £( v)le"  is  the  half-plane  to  the 
left  of  and  including  £(v). 
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Observe  that  the  query  algorithm  above  never  assumes  that  the  query  range 
R  is  a  rectangle.  Indeed,  it  works  for  any  other  query  range  as  well. 


We  now  analyze  the  time  a  query  with  a  rectangular  range  takes. 

Lemma  5.4  A  query  with  an  axis-parallel  rectangle  in  a  kd-tree  storing  n  points 
can  be  performed  in  0(\/n  +  k)  time,  where  k  is  the  number  of  reported  points. 

Proof.  First  of  all,  note  that  the  time  to  traverse  a  subtree  and  report  the  points 
stored  in  its  leaves  is  linear  in  the  number  of  reported  points.  Hence,  the  total 
time  required  for  traversing  subtrees  in  steps  4  and  8  is  O(k),  where  k  is  the 
total  number  of  reported  points.  It  remains  to  bound  the  number  of  nodes 
visited  by  the  query  algorithm  that  are  not  in  one  of  the  traversed  subtrees. 
(These  are  the  light  grey  nodes  in  Figure  5.5.)  For  each  such  node  v,  the  query 
range  properly  intersects  region(v),  that  is,  region(v)  is  intersected  by,  but  not 
fully  contained  in  the  range.  In  other  words,  the  boundary  of  the  query  range 
intersects  region(v).  To  analyze  the  number  of  such  nodes,  we  shall  bound  the 
number  of  regions  intersected  by  any  vertical  line.  This  will  give  us  an  upper 
bound  on  the  number  of  regions  intersected  by  the  left  and  right  edge  of  the 
query  rectangle.  The  number  of  regions  intersected  by  the  bottom  and  top  edges 
of  the  query  range  can  be  bounded  in  the  same  way. 

Let  £  be  a  vertical  line,  and  let  O'  be  a  kd-tree.  Let  f  (roof  (O'))  be  the  splitting 
line  stored  at  the  root  of  the  kd-tree.  The  line  i  intersects  either  the  region  to 
the  left  of  f  (root (7))  or  the  region  to  the  right  of  £ (roof(T)),  but  not  both.  This 
observation  seems  to  imply  that  Q(n),  the  number  of  intersected  regions  in  a 
kd-tree  storing  a  set  of  n  points,  satisfies  the  recurrence  Q(n)  =  1  +  Q(n/ 2).  But 
this  is  not  true,  because  the  splitting  lines  are  horizontal  at  the  children  of  the 
root.  This  means  that  if  the  line  £  intersects  for  instance  region(lc(root(7)) ,  then 
it  will  always  intersect  the  regions  corresponding  to  both  children  of  lc(root( T)). 
Hence,  the  recursive  situation  we  get  is  not  the  same  as  the  original  situation, 
and  the  recurrence  above  is  incorrect.  To  overcome  this  problem  we  have  to 
make  sure  that  the  recursive  situation  is  exactly  the  same  as  the  original  situation: 
the  root  of  the  subtree  must  contain  a  vertical  splitting  line.  This  leads  us  to 
redefine  Q(n)  as  the  number  of  intersected  regions  in  a  kd-tree  storing  n  points 
whose  root  contains  a  vertical  splitting  line.  To  write  a  recurrence  for  Q(n)  we 
now  have  to  go  down  two  steps  in  the  tree.  Each  of  the  four  nodes  at  depth 
two  in  the  tree  corresponds  to  a  region  containing  n/4  points.  (To  be  precise,  a 
region  can  contain  at  most  \\n/ 2]/2]  =  [n/4]  points,  but  asymptotically  this 
does  not  influence  the  outcome  of  the  recurrence  below.)  Two  of  the  four  nodes 
correspond  to  intersected  regions,  so  we  have  to  count  the  number  of  intersected 
regions  in  these  subtrees  recursively.  Moreover,  t  intersects  the  region  of  the 
root  and  of  one  of  its  children.  Hence,  Q(n)  satisfies  the  recurrence 


Q(n) 


0(1),  if  n  =  1 , 

2  +  2Q(n/4),  if  n  >  1 . 
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This  recurrence  solves  to  Q(n)  =  0(y/n).  In  other  words,  any  vertical  line 
intersects  0(^/n)  regions  in  a  kd-tree.  In  a  similar  way  one  can  prove  that  the 


total  number  of  regions  intersected  by  a  horizontal  line  is  0{^fn).  The  total 
number  of  regions  intersected  by  the  boundary  of  a  rectangular  query  range  is 
bounded  by  0(y/n )  as  well.  0 

The  analysis  of  the  query  time  that  we  gave  above  is  rather  pessimistic:  we 
bounded  the  number  of  regions  intersecting  an  edge  of  the  query  rectangle  by  the 
number  of  regions  intersecting  the  line  through  it.  In  many  practical  situations 
the  range  will  be  small.  As  a  result,  the  edges  are  short  and  will  intersect  much 
fewer  regions.  For  example,  when  we  search  with  a  range  [x  :  x]  x  [y  :  y] — this 
query  effectively  asks  whether  the  point  (x,y)  is  in  the  set — the  query  time  is 
bounded  by  O(\ogn). 

The  following  theorem  summarizes  the  performance  of  kd-trees. 

Theorem  5.5  A  kd-tree  for  a  set  P  of  n  points  in  the  plane  uses  0(n)  storage 
and  can  be  built  in  0(n  log/;)  time.  A  rectangular  range  query  on  the  kd-tree 
takes  0{\Jn  +  k)  time,  where  k  is  the  number  of  reported  points. 

Kd-trees  can  also  be  used  for  point  sets  in  3-  or  higher-dimensional  space. 
The  construction  algorithm  is  very  similar  to  the  planar  case:  At  the  root,  we 
split  the  set  of  points  into  two  subsets  of  roughly  the  same  size  by  a  hyperplane 
perpendicular  to  the  x\  -axis.  In  other  words,  at  the  root  the  point  set  is  partitioned 
based  on  the  first  coordinate  of  the  points.  At  the  children  of  the  root  the  partition 
is  based  on  the  second  coordinate,  at  nodes  at  depth  two  on  the  third  coordinate, 
and  so  on,  until  at  depth  d  —  1  we  partition  on  the  last  coordinate.  At  depth 
d  we  start  all  over  again,  partitioning  on  first  coordinate.  The  recursion  stops 
when  there  is  only  one  point  left,  which  is  then  stored  at  a  leaf.  Because  a 
cZ-dimensional  kd-tree  for  a  set  of  n  points  is  a  binary  tree  with  n  leaves,  it  uses 
0(n)  storage.  The  construction  time  is  0{n  logn).  (As  usual,  we  assume  d  to 
be  a  constant.) 

Nodes  in  a  t/-dimensional  kd-tree  correspond  to  regions,  as  in  the  plane.  The 
query  algorithm  visits  those  nodes  whose  regions  are  properly  intersected  by 
the  query  range,  and  traverses  subtrees  (to  report  the  points  stored  in  the  leaves) 
that  are  rooted  at  nodes  whose  region  is  fully  contained  in  the  query  range.  It 
can  be  shown  that  the  query  time  is  bounded  by  0(nl~l/d  +  k). 


5.3  Range  Trees 

Kd-trees,  which  were  described  in  the  previous  section,  have  0{\fn  +  k)  query 
time.  So  when  the  number  of  reported  points  is  small,  the  query  time  is  relatively 
high.  In  this  section  we  shall  describe  another  data  structure  for  rectangular 
range  queries,  the  range  tree ,  which  has  a  better  query  time,  namely  (9(log2  n  + 
k).  The  price  we  have  to  pay  for  this  improvement  is  an  increase  in  storage  from 
0(n)  for  kd-trees  to  0(n  log  )  for  range  trees. 


Section  5.3 

RANGE  TREES 


As  we  observed  before,  a  2-dimensional  range  query  is  essentially  composed  of 
two  1 -dimensional  sub-queries,  one  on  the  x-coordinate  of  the  points  and  one 
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Chapter  5  on  the  y-coordinate.  This  gave  us  the  idea  to  split  the  given  point  set  alternately 

orthogonal  range  searching  on  x-  and  y-coordinate,  leading  to  the  kd-tree.  To  obtain  the  range  tree,  we  shall 

use  this  observation  in  a  different  way. 

Let  P  be  a  set  of  11  points  in  the  plane  that  we  want  to  preprocess  for 
rectangular  range  queries.  Let  [x  :  x']  x  [y  :  /]  be  the  query  range.  We  first 
concentrate  on  finding  the  points  whose  x-coordinate  lies  in  [x  :  x7],  the  x- 
interval  of  the  query  rectangle,  and  worry  about  the  y-coordinate  later.  If  we 
only  care  about  the  x-coordinate  then  the  query  is  a  1 -dimensional  range  query. 
In  Section  5.1  we  have  seen  how  to  answer  such  a  query:  with  a  binary  search 
tree  on  the  x-coordinate  of  the  points.  The  query  algorithm  was  roughly  as 
follows.  We  search  with  x  and  x'  in  the  tree  until  we  get  to  a  node  vspiit  where 
the  search  paths  split.  From  the  left  child  of  vspiit  we  continue  the  search  with  x, 
and  at  every  node  v  where  the  search  path  of  x  goes  left,  we  report  all  points 
in  the  right  subtree  of  v.  Similarly,  we  continue  the  search  with  x'  at  the  right 
child  of  vspijt,  and  every  node  v  where  the  search  path  of  x'  goes  right  we 
report  all  points  in  the  left  subtree  of  v.  Finally,  we  check  the  leaves  /i  and  il' 
where  the  two  paths  end  to  see  if  they  contain  a  point  in  the  range.  In  effect,  we 
select  a  collection  of  O(\ogn)  subtrees  that  together  contain  exactly  the  points 
whose  x-coordinate  lies  in  the  x-interval  of  the  query  rectangle. 

Let’s  call  the  subset  of  points  stored  in  the  leaves  of  the  subtree  rooted  at  a 
node  v  the  canonical  subset  of  v.  The  canonical  subset  of  the  root  of  the  tree, 
for  instance,  is  the  whole  set  P.  The  canonical  subset  of  a  leaf  is  simply  the 
point  stored  at  that  leaf.  We  denote  the  canonical  subset  of  node  v  by  P(v).  We 
have  just  seen  that  the  subset  of  points  whose  x-coordinate  lies  in  a  query  range 
can  be  expressed  as  the  disjoint  union  of  O(\ogn)  canonical  subsets;  these  are 
the  sets  P(v)  of  the  nodes  v  that  are  the  roots  of  the  selected  subtrees.  We  are 
not  interested  in  all  the  points  in  such  a  canonical  subset  P(v),  but  only  want  to 
report  the  ones  whose  y-coordinate  lies  in  the  interval  [y  :  y'].  This  is  another 
1 -dimensional  query,  which  we  can  solve,  provided  we  have  a  binary  search  tree 
on  the  y-coordinate  of  the  points  in  P(v)  available.  This  leads  to  the  following 
data  structure  for  rectangular  range  queries  on  a  set  P  of  n  points  in  the  plane. 

■  The  main  tree  is  a  balanced  binary  search  tree  7  built  on  the  x-coordinate  of 
the  points  in  P. 

m  For  any  internal  or  leaf  node  v  in  'T,  the  canonical  subset  P(  v)  is  stored  in 
a  balanced  binary  search  tree  TasSoc(v)  on  the  y-coordinate  of  the  points. 
The  node  v  stores  a  pointer  to  the  root  of  ‘Tassoc(v),  which  is  called  the 
associated  structure  of  V. 

This  data  structure  is  called  a  range  tree.  Figure  5.6  shows  the  structure  of  a 
range  tree.  Data  structures  where  nodes  have  pointers  to  associated  structures 
are  often  called  multi-level  data  structures.  The  main  tree  T  is  then  called 
the  first-level  tree ,  and  the  associated  structures  are  second-level  trees.  Multi¬ 
level  data  structures  play  an  important  role  in  computational  geometry;  more 
examples  can  be  found  in  Chapters  10  and  16. 
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A  range  tree  can  be  constructed  with  the  following  recursive  algorithm,  which 
receives  as  input  the  set  P  :=  {p\,  ...,pn}  of  points  sorted  on  x-coordinate  and 


returns  the  root  of  a  2-dimensional  range  tree  7  of  P.  As  in  the  previous  section, 
we  assume  that  no  two  points  have  the  same  x-  or  y-coordinate.  We  shall  get  rid 
of  this  assumption  in  Section  5.5. 

Algorithm  Build2DRangeTree(P) 

Input.  A  set  P  of  points  in  the  plane. 

Output.  The  root  of  a  2-dimensional  range  tree. 

1.  Construct  the  associated  structure:  Build  a  binary  search  tree  Tassoc  on  the 
set  Py  of  y-coordinates  of  the  points  in  P.  Store  at  the  leaves  of  TasSoc  not 
just  the  y-coordinate  of  the  points  in  Py,  but  the  points  themselves. 

2.  if  P  contains  only  one  point 

3.  then  Create  a  leaf  v  storing  this  point,  and  make  Tass0c  the  associated 

structure  of  v. 

4.  else  Split  P  into  two  subsets;  one  subset  P\e ft  contains  the  points  with 

x-coordinate  less  than  or  equal  to  xmK|,  the  median  x-coordinate, 
and  the  other  subset  Bright  contains  the  points  with  x-coordinate 
larger  than  xm;d. 

5-  Vieft  <—  BUILD2DRANGETREE(7jeft) 

6.  Vright  <—  BUILD2DRANGETREE(Pright) 

7.  Create  a  node  v  storing  xmK|,  make  Vieft  the  left  child  of  v,  make 
Vright  the  right  child  of  v,  and  make  Tassoc  the  associated  structure 
of  v. 

8.  return  v 

Note  that  in  the  leaves  of  the  associated  structures  we  do  not  just  store  the 
y-coordinate  of  the  points  but  the  points  themselves.  This  is  important  because, 
when  searching  the  associated  structures,  we  need  to  report  the  points  and  not 
just  the  y-coordinates. 

Lemma  5.6  A  range  tree  on  a  set  of  n  points  in  the  plane  requires  0(n  log/;) 
storage. 


Proof.  A  point  p  in  P  is  stored  only  in  the  associated  structure  of  nodes  on  the 
path  in  T  towards  the  leaf  containing  p.  Hence,  for  all  nodes  at  a  given  depth  of  T, 
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Chapter  5  the  point  p  is  stored  in  exactly  one  associated  structure.  Because  1 -dimensional 
orthogonal  range  searching  range  trees  use  linear  storage  it  follows  that  the  associated  structures  of  all  nodes 

at  any  depth  of  “J  together  use  0(n)  storage.  The  depth  of  T  is  O(\ogn).  Hence, 
the  total  amount  of  storage  required  is  bounded  by  O(nlogn).  0 

Algorithm  Build2DRangeTree  as  it  is  described  will  not  result  in  the 
optimal  construction  time  of  0(n  logn).  To  obtain  this  we  have  to  be  a  bit 
careful.  Constructing  a  binary  search  tree  on  an  unsorted  set  of  n  keys  takes 
0(n  logn)  time.  This  means  that  constructing  the  associated  structure  in  line  1 
would  take  0(n  logn)  time.  But  we  can  do  better  if  the  points  in  Py  are  presorted 
on  y-coordinate;  then  the  binary  search  tree  can  be  constructed  bottom-up  in 
linear  time.  During  the  construction  algorithm  we  therefore  maintain  the  set  of 
points  in  two  lists,  one  sorted  on  x-coordinate  and  one  sorted  on  y-coordinate. 
This  way  the  time  we  spend  at  a  node  in  the  main  tree  7  is  linear  in  the  size  of 
its  canonical  subset.  This  implies  that  the  total  construction  time  is  the  same  as 
the  amount  of  storage,  namely  O(nlogn).  Since  the  presorting  takes  O(n\ogn) 
time  as  well,  the  total  construction  time  is  again  O(nlogn). 

The  query  algorithm  first  selects  (9(log«)  canonical  subsets  that  together  contain 
the  points  whose  x-coordinate  lie  in  the  range  [x  :  x'] .  This  can  be  done  with 
the  1 -dimensional  query  algorithm.  Of  those  subsets,  we  then  report  the  points 
whose  y-coordinate  lie  in  the  range  [y :  y'] .  For  this  we  also  use  the  1 -dimensional 
query  algorithm;  this  time  it  is  applied  to  the  associated  structures  that  store  the 
selected  canonical  subsets.  Thus  the  query  algorithm  is  virtually  the  same  as 
the  1 -dimensional  query  algorithm  IDRangeQuery;  the  only  difference  is 
that  calls  to  ReportSubtree  are  replaced  by  calls  to  IDRangeQuery. 

Algorithm  2DRangeQuery(T,  [x  :  x']  x  [y  :  y']) 

Input.  A  2-dimensional  range  tree  T  and  a  range  [x :  x']  x  [y  :  y'] . 

Output.  All  points  in  T  that  lie  in  the  range. 

1-  vspii,  <—  FindSplitNode(T,x,x/) 

2.  if  vspiit  is  a  leaf 

3.  then  Check  if  the  point  stored  at  vspi;t  must  be  reported. 

4.  else  (*  Follow  the  path  to  x  and  call  IDRangeQuery  on  the  subtrees 
right  of  the  path.  *) 

5.  v  4-  fc(vSpiit) 

6.  while  V  is  not  a  leaf 

7.  doifx<xv 

8.  then  lDRANGEQUERY(TasSoc(rc(v)),  \y  :  /]) 

9.  v  < —  lc(v ) 

10.  else  v  <—  rc(v) 

1 1 .  Check  if  the  point  stored  at  v  must  be  reported. 

12.  Similarly,  follow  the  path  from  rc(vspijt)  to  x! ,  call  IDRange¬ 
Query  with  the  range  [y  :  y']  on  the  associated  structures  of  sub¬ 
trees  left  of  the  path,  and  check  if  the  point  stored  at  the  leaf  where 
the  path  ends  must  be  reported. 


108 


Lemma  5.7  A  query  with  an  axis-parallel  rectangle  in  a  range  tree  storing  n 
points  takes  <9(log2  n  +  k )  time,  where  k  is  the  number  of  reported  points. 

Proof.  At  each  node  v  in  the  main  tree  T  we  spend  constant  time  to  decide  where 
the  search  path  continues,  and  we  possibly  call  IDRangeQuery.  Theorem  5.2 
states  that  the  time  we  spend  in  this  recursive  call  is  0{\ogn  +kv),  where  kv  is 
the  number  of  points  reported  in  this  call.  Hence,  the  total  time  we  spend  is 

Y^O(\ogn  +  kv), 

V 

where  the  summation  is  over  all  nodes  in  the  main  tree  T  that  are  visited.  Notice 
that  the  sum  )_v  kv  equals  k,  the  total  number  of  reported  points.  Furthermore, 
the  search  paths  of  x  and  xJ  in  the  main  tree  7  have  length  O(\ogn).  Hence, 
£v<9(log«)  =  <9(log2«).  The  lemma  follows.  0 

The  following  theorem  summarizes  the  performance  of  2-dimensional  range 
trees. 

Theorem  5.8  Let  P  be  a  set  of  n  points  in  the  plane.  A  range  tree  for  P  uses 
O(nlogn)  storage  and  can  be  constructed  in  0(n  log/;)  time.  By  querying  this 
range  tree  one  can  report  the  points  in  P  that  lie  in  a  rectangular  query  range  in 
(9  (log 2  n  +  k)  time,  where  k  is  the  number  of  reported  points. 

The  query  time  stated  in  Theorem  5.8  can  be  improved  to  O(\og  n  +  k)  by  a 
technique  called  fractional  cascading.  This  is  described  in  Section  5.6. 


5.4  Higher-Dimensional  Range  Trees 

It  is  fairly  straightforward  to  generalize  2-dimensional  range  trees  to  higher¬ 
dimensional  range  trees.  We  only  describe  the  global  approach. 

Let  P  be  a  set  of  points  in  d -dimensional  space.  We  construct  a  balanced 
binary  search  tree  on  the  first  coordinate  of  the  points.  The  canonical  subset 
P(y)  of  a  node  v  in  this  first-level  tree,  the  main  tree,  consists  of  the  points 
stored  in  the  leaves  of  the  subtree  rooted  at  v.  For  each  node  v  we  construct 
an  associated  structure  Tassoc(v);  the  second-level  tree  Tassoc(v)  is  a  (d  —  1)- 
dimensional  range  tree  for  the  points  in  P(v),  restricted  to  their  last  d  I 
coordinates.  This  (d  —  1) -dimensional  range  tree  is  constructed  recursively  in 
the  same  way:  it  is  a  balanced  binary  search  tree  on  the  second  coordinate  of  the 
points,  in  which  each  node  has  a  pointer  to  a  (d  —  2) -dimensional  range  tree  of 
the  points  in  its  subtree,  restricted  to  the  last  ( d  —  2)  coordinates.  The  recursion 
stops  when  we  are  left  with  points  restricted  to  their  last  coordinate;  these  are 
stored  in  a  1 -dimensional  range  tree — a  balanced  binary  search  tree. 

The  query  algorithm  is  also  very  similar  to  the  2-dimensional  case.  We  use 
the  first-level  tree  to  locate  O(logn)  nodes  whose  canonical  subsets  together 
contain  all  the  points  whose  first  coordinates  are  in  the  correct  range.  These 
canonical  subsets  are  queried  further  by  performing  a  range  query  on  the  cor¬ 
responding  second-level  structures.  In  each  second-level  structure  we  select 


Section  5.4 

HIGHER-DIMENSIONAL  RANGE  TREES 
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Chapter  5  O(logn)  canonical  subsets.  This  means  there  are  0(log2n)  canonical  subsets 
orthogonal  range  searching  in  the  second-level  structures  in  total.  Together,  they  contain  all  points  whose 

first  and  second  coordinate  lie  in  the  correct  ranges.  The  third-level  structures 
storing  these  canonical  subsets  are  then  queried  with  the  range  for  the  third 
coordinate,  and  so  on,  until  we  reach  the  1 -dimensional  trees.  In  these  trees  we 
find  the  points  whose  last  coordinate  lies  in  the  correct  range  and  report  them. 
This  approach  leads  to  the  following  result. 

Theorem  5.9  Let  P  be  a  set  of  n  points  in  d -dimensional  space,  where  d  f  2. 
A  range  tree  for  P  uses  0(nlogd~l  n)  storage  and  it  can  be  constructed  in 
0(n  log'1  1  n)  time.  One  can  report  the  points  in  P  that  lie  in  a  rectangular  query 
range  in  O(logd  n  +  k)  time,  where  k  is  the  number  of  reported  points. 

Proof.  Let  Td(n)  denote  the  construction  time  for  a  range  tree  on  a  set  of  n  points 
in  (7-dimensional  space.  By  Theorem  5.8  we  know  that  72(h)  =  0{n\ogn).  The 
construction  of  a  (7-dimensional  range  tree  consists  of  building  a  balanced  binary 
search  tree,  which  takes  time  0[n  log/;),  and  the  construction  of  associated 
structures.  At  the  nodes  at  any  depth  of  the  first-level  tree,  each  point  is  stored 
in  exactly  one  associated  structure.  The  time  required  to  build  all  associated 
structures  of  the  nodes  at  some  depth  is  0(Td_i(n)),  the  time  required  to  build 
the  associated  structure  of  the  root.  This  follows  because  the  building  time  is  at 
least  linear.  Hence,  the  total  construction  time  satisfies 

Td(n)  =  O (n log n)  +  0 (log n)  ■  Td_x(n). 

Since  72(h)  =  0(n  logn),  this  recurrence  solves  to  0(n  \ogd  The  bound 
on  the  amount  of  storage  follows  in  the  same  way. 

Let  Qd(n)  denote  the  time  spent  in  querying  a  (7-dimensional  range  tree  on 
h  points,  not  counting  the  time  to  report  points.  Querying  the  (7-dimensional 
range  tree  involves  searching  in  a  first-level  tree,  which  takes  time  O(\ogn),  and 
querying  a  logarithmic  number  of  {d  —  1) -dimensional  range  trees.  Hence, 

Qd(n)  =  O(logn)  +  O(logn)  •  Qd-i(n), 

where  Qifi)  =  O(\og2n).  This  recurrence  easily  solves  to  Qd(n)  =  0(\ogd n). 
We  still  have  to  add  the  time  needed  to  report  points,  which  is  bounded  by  0(k). 
The  bound  on  the  query  time  follows.  0 

As  in  the  2-dimensional  case,  the  query  time  can  be  improved  by  a  logarith¬ 
mic  factor — see  Section  5.6. 


5.5  General  Sets  of  Points 

Until  now  we  imposed  the  restriction  that  no  two  points  have  equal  x-  or  y- 
coordinate,  which  is  highly  unrealistic.  Fortunately,  this  is  easy  to  remedy.  The 
crucial  observation  is  that  we  never  assumed  the  coordinate  values  to  be  real 
numbers.  We  only  need  that  they  come  from  a  totally  ordered  universe,  so  that 
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we  can  compare  any  two  coordinates  and  compute  medians.  Therefore  we  can 
use  the  trick  described  next. 

We  replace  the  coordinates,  which  are  real  numbers,  by  elements  of  the  so-called 
composite-number  space.  The  elements  of  this  space  are  pairs  of  reals.  The 
composite  number  of  two  reals  a  and  b  is  denoted  by  (a\b).  We  define  a  total 
order  on  the  composite-number  space  by  using  a  lexicographic  order.  So,  for 
two  composite  numbers  (a\b)  and  ( a'\b '),  we  have 

( a\b )  <  (a'\b')  <£=>  a  <  a'  or  (a  =  a'  and  b  <  b'). 

Now  assume  we  are  given  a  set  P  of  n  points  in  the  plane.  The  points  are 
distinct,  but  many  points  can  have  the  same  x-  or  y-coordinate.  We  replace  each 
point  p  :=  ( px,Py )  by  a  new  point  p  :=  ((px\Py),  (Py\Px))  that  has  composite 
numbers  as  coordinate  values.  This  way  we  obtain  a  new  set  P  of  n  points. 
The  first  coordinate  of  any  two  points  in  P  are  distinct;  the  same  holds  true  for 
the  second  coordinate.  Using  the  order  defined  above  one  can  now  construct 
kd-trees  and  2-dimensional  range  trees  for  P. 

Now  suppose  we  want  to  report  the  points  of  P  that  lie  in  a  range  R  : 

[x :  x']  x  [y :  /].  To  this  end  we  must  query  the  tree  we  have  constructed  for  P. 
This  means  that  we  must  also  transform  the  query  range  to  our  new  composite 
space.  The  transformed  range  R  is  defined  as  follows; 

R  :=  [0| -°°)  :  0'l+°°)]  x  [01 -°°) :  (/l+°°)]- 

It  remains  to  prove  that  our  approach  is  correct,  that  is,  that  the  points  of  P  that 
we  report  when  we  query  with  R  correspond  exactly  to  the  points  of  P  that  lie  in 
R. 

Lemma  5.10  Let  p  be  a  point  and  R  a  rectangular  range.  Then 

pGR  pGR. 

Proof.  Let  R  :=  [. x :  x']  x  [y  :  y']  and  let  p  :=  (px.py).  By  definition,  p  lies  in  R 
if  and  only  if  x  ^  px  ^  x'  and  y  ^  py  ^  /.  This  is  easily  seen  to  hold  if  and  only 

if  01  -°°)  <  (Px\Py)  <  01  +°°)  and  01  -°°)  <  (Py\Px)  <  01  +°°)» that  is,  if 

and  only  if  p  lies  in  R.  0 

We  can  conclude  that  our  approach  is  indeed  correct:  we  will  get  the  correct 
answer  to  a  query.  Observe  that  there  is  no  need  to  actually  store  the  transformed 
points:  we  can  just  store  the  original  points,  provided  that  we  do  comparisons 
between  two  v-coordinates  or  two  y-coordi nates  in  the  composite  space. 

The  approach  of  using  composite  numbers  can  also  be  used  in  higher  dimen¬ 
sions. 


Section  5.6* 

FRACTIONAL  CASCADING 


5.6*  Fractional  Cascading 

In  Section  5.3  we  described  a  data  structure  for  rectangular  range  queries  in  the 
plane,  the  range  tree,  whose  query  time  is  <9(log2  n  +  k).  (Here  n  is  the  total 
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Chapter  5  number  of  points  stored  in  the  data  structure,  and  k  is  the  number  of  reported 
orthogonal  range  searching  points.)  In  this  section  we  describe  a  technique,  called  fractional  cascading ,  to 

reduce  the  query  time  to  (9 (  log n  +  k). 

Let’s  briefly  recall  how  a  range  tree  works.  A  range  tree  for  a  set  P  of  points 
in  the  plane  is  a  two-level  data  structure.  The  main  tree  is  a  binary  search 
tree  on  the  x-ccordinate  of  the  points.  Each  node  v  in  the  main  tree  has  an 
associated  structure  ‘Tassoc(v),  which  is  a  binary  search  tree  on  the  y-coordinate 
of  the  points  in  P(v).  the  canonical  subset  of  v.  A  query  with  a  rectangular 
range  [x  :  x'}  x  [y  :  y'\  is  performed  as  follows:  First,  a  collection  of  O(\ogn) 
nodes  in  the  main  tree  is  identified  whose  canonical  subsets  together  contain  the 
points  with  x-coordinate  in  the  range  [x :  x'] .  Second,  the  associated  structures  of 
these  nodes  are  queried  with  the  range  [y  :  y'\.  Querying  an  associated  structure 
‘Jassoc(v)  is  a  1-dimensional  range  query,  so  it  takes  O(\ogn  +  kv)  time,  where 
kv  is  the  number  of  reported  points.  Hence,  the  total  query  time  is  <9(log2  n  +  k). 

If  we  could  perform  the  searches  in  the  associated  structures  in  0(1  +kv) 
time,  then  the  total  query  time  would  reduce  to  O(\ogn+k).  But  how  can  we 
do  this?  In  general,  it  is  not  possible  to  answer  a  1 -dimensional  range  query  in 
0(1  +  k)  time,  with  k  the  number  of  answers.  What  saves  us  is  that  we  have  to 
do  many  1 -dimensional  searches  with  the  same  range,  and  that  we  can  use  the 
result  of  one  search  to  speed  up  other  searches. 

We  first  illustrate  the  idea  of  fractional  cascading  with  a  simple  example.  Let  S \ 
and  Si  be  two  sets  of  objects,  where  each  object  has  a  key  that  is  a  real  number. 
These  sets  are  stored  in  sorted  order  in  arrays  A  i  and  A 2 ■  Suppose  we  want  to 
report  all  objects  in  Si  and  in  Si  whose  keys  lie  in  a  query  interval  [y  : y'\ .  We 
can  do  this  as  follows:  we  do  a  binary  search  with  y  in  A\  to  find  the  smallest 
key  larger  than  or  equal  to  y.  From  there  we  walk  through  the  array  to  the  right, 
reporting  the  objects  we  pass,  until  a  key  larger  than  y'  is  encountered.  The 
objects  from  Si  can  be  reported  in  a  similar  fashion.  If  the  total  number  of 
reported  objects  is  k,  then  the  query  time  will  be  0(k)  plus  the  time  for  two 
binary  searches,  one  in  A\  and  one  in  A2.  If,  however,  the  keys  of  the  objects  in 
Si  are  a  subset  of  the  keys  of  the  objects  in  Si,  then  we  can  avoid  the  second 
binary  search  as  follows.  We  add  pointers  from  the  entries  in  A]  to  the  entries 
in  ,4i:  if  A 1  [/]  stores  an  object  with  key  y,,  then  we  store  a  pointer  to  the  entry 
in  A 2  with  the  smallest  key  larger  than  or  equal  to  y,.  If  there  is  no  such  key 
then  the  pointer  from  A\ [z]  is  nil.  Figure  5.7  illustrates  this.  (Only  the  keys  are 
shown  in  this  figure,  not  the  corresponding  objects.) 

How  can  we  use  this  structure  to  report  the  objects  in  S 1  and  S2  whose  keys 
are  in  a  query  interval  [y  :  /]?  Reporting  the  objects  in  Si  is  still  done  as  before: 
do  a  binary  search  with  v  i  11  A  \ ,  and  walk  through  A\  to  the  right  until  a  key 
larger  than  y'  is  encountered.  To  report  the  points  from  Si  we  proceed  as  follows. 
Let  the  search  fory  i  11  A  \  end  at  A  [i] .  Hence,  the  key  of  A  [z]  is  the  smallest  one 
in  Si  that  is  larger  than  or  equal  to  y.  Since  the  keys  from  Si  form  a  subset  of 
the  keys  from  Si,  this  means  that  the  pointer  from  A[i]  must  point  to  the  smallest 
key  from  Si  larger  than  or  equal  to  y.  Hence,  we  can  follow  this  pointer,  and 
from  there  start  to  walk  to  the  right  through  A 2.  This  way  the  binary  search  in 
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Figure  5. 7 

Speeding  up  the  search  by  adding 
pointers 


A 2  is  avoided,  and  reporting  the  objects  from  S2  takes  only  0(1  +k)  time,  with 
k  the  number  of  reported  answers. 

Figure  5.7  shows  an  example  of  a  query.  We  query  with  the  range  [20  :  65]. 
First  we  use  binary  search  in  Aj  to  find  23,  the  smallest  key  larger  than  or  equal 
to  20.  From  there  we  walk  to  the  right  until  we  encounter  a  key  larger  than 
65.  The  objects  that  we  pass  have  their  keys  in  the  range,  so  they  are  reported. 
Then  we  follow  the  pointer  from  23  into  A 2.  We  get  to  the  key  30,  which  is  the 
smallest  one  larger  than  or  equal  to  20  in  A2.  From  there  we  also  walk  to  the 
right  until  we  reach  a  key  larger  than  65,  and  report  the  objects  from  S 2  whose 
keys  are  in  the  range. 

Now  let’s  return  to  range  trees.  The  crucial  observation  here  is  that  the  canonical 
subsets  P(lc(v))  and  P(rc(v ))  both  are  subsets  of  P(v).  As  a  result  we  can 
use  the  same  idea  to  speed  up  the  query  time.  The  details  are  slightly  more 
complicated,  because  we  now  have  two  subsets  of  P(v)  to  which  we  need 
fast  access  rather  than  only  one.  Let  O'  be  a  range  tree  on  a  set  P  of  11  points 
in  the  plane.  Each  canonical  subset  P(v)  is  stored  in  an  associated  structure. 
But  instead  of  using  a  binary  search  tree  as  associated  structure,  as  we  did 
in  Section  5.3,  we  now  store  it  in  an  array  A(v).  The  array  is  sorted  on  the 
y-coordinate  of  the  points.  Furthermore,  each  entry  in  an  array  A(v)  stores  two 
pointers:  a  pointer  into  A(lc(v))  and  a  pointer  into  ,4 (ref  v) ) .  More  precisely, 
we  add  the  following  pointers.  Suppose  that  A(v)  [z]  stores  a  point  p.  Then  we 
store  a  pointer  from  A(v)  [/]  to  the  entry  of  A(/c(v))  such  that  the  y-coordinate  of 
the  point  p'  stored  there  is  the  smallest  one  larger  than  or  equal  to  py.  As  noted 
above,  P(lc(v))  is  a  subset  of  P(v).  Hence,  if  p  has  the  smallest  y-coordinate 
larger  than  or  equal  to  some  value  y  of  any  point  in  P(v),  then  p'  has  the  smallest 
y-coordinate  larger  than  or  equal  to  y  of  any  point  in  P(lc(v)).  The  pointer 
into  A(rc(v))  is  defined  in  the  same  way:  it  points  to  the  entry  such  that  the 
y-coordinate  of  the  point  stored  there  is  the  smallest  one  that  is  larger  than  or 
equal  to  py. 

This  modified  version  of  the  range  tree  is  called  a  layered  range  tree ; 
Figures  5.8  and  5.9  show  an  example.  (The  positions  in  which  the  arrays  are 
drawn  corresponds  to  the  positions  of  the  nodes  in  the  tree  they  are  associated 
with:  the  topmost  array  is  associated  with  the  root,  the  left  array  below  it  is 
associated  with  the  left  child  of  the  root,  and  so  on.  Not  all  pointers  are  shown 
in  the  figure.) 

Let’s  see  how  to  answer  a  query  with  a  range  [a  :  xJ]  x  |y  :  y']  in  a  layered 
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Figure  5.8 

The  main  tree  of  a  layered  range  tree: 
the  leaves  show  only  the  x-coordinates; 
the  points  stored  are  given  below 


(2,19)  (7,10)  (12,3)  (17,62)  (21,49)  (41,95)  (58,59)  (93,70) 
(5,80)  (8,37)  (15,99)  (33,30)  (52,23)  (67,89) 


Figure  5.9 

The  arrays  associated  with  the  nodes  in 
the  main  tree,  with  the  y-coordinate  of 
the  points  of  the  canonical  subsets  in 
sorted  order  (not  all  pointers  are  shown) 
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range  tree.  As  before  we  search  with  x  and  xJ  in  the  main  tree  T  to  determine  Section  5.7 

O(logn)  nodes  whose  canonical  subsets  together  contain  the  points  with  x-  notes  and  comments 

coordinate  in  the  range  [x  :  x'].  These  nodes  are  found  as  follows.  Let  vspiit  be 

the  node  where  the  two  search  paths  split.  The  nodes  that  we  are  looking  for 

are  the  ones  below  vspi;t  that  are  the  right  child  of  a  node  on  the  search  path  to  x 

where  the  path  goes  left,  or  the  left  child  of  a  node  on  the  search  path  to  xJ  where 

the  path  goes  right.  At  vspi;t  we  find  the  entry  in  A(vspijt)  whose  y-coordinate  is 

the  smallest  one  larger  than  or  equal  to  y.  This  can  be  done  in  O(\ogn)  time  by 

binary  search.  While  we  search  further  with  x  and  xJ  in  the  main  tree,  we  keep 

track  of  the  entry  in  the  associated  arrays  whose  y-coordinate  is  the  smallest  one 

larger  than  or  equal  to  y.  They  can  be  maintained  in  constant  time  by  following 

the  pointers  stored  in  the  arrays.  Now  let  v  be  one  of  the  O(logn)  nodes  we 

selected.  We  must  report  the  points  stored  in  A(v)  whose  y-coordinate  is  in 

the  range  [y  :  y'].  For  this  it  suffices  to  be  able  to  find  the  point  with  smallest 

y-coordinate  larger  than  or  equal  to  y;  from  there  we  can  just  walk  through  the 

array,  reporting  points  as  long  as  their  y-coordinate  is  less  than  or  equal  to  y'. 

This  point  can  be  found  in  constant  time,  because  parent(v)  is  on  the  search 
path,  and  we  kept  track  of  the  points  with  smallest  y-coordinate  larger  than  or 
equal  to  y  in  the  arrays  on  the  search  path.  Hence,  we  can  report  the  points  of 
A(v)  whose  y-coordinate  is  in  the  range  [y  :  y']  in  0(1  +kv )  time,  where  kv  is 
the  number  of  reported  answers  at  node  v.  The  total  query  time  now  becomes 
0(logn  +  k). 

Fractional  cascading  also  improves  the  query  time  of  higher-dimensional  range 
trees  by  a  logarithmic  factor.  Recall  that  a  d -dimensional  range  query  was 
solved  by  first  selecting  the  points  whose  d/-th  coordinate  is  in  the  correct  range 
in  O(\ogn)  canonical  subsets,  and  then  solving  a  (d  —  l)-dimensional  query 
on  these  subsets.  The  (d  —  1) -dimensional  query  is  solved  recursively  in  the 
same  way.  This  continues  until  we  arrive  at  a  2-dimensional  query,  which  can 
be  solved  as  described  above.  This  leads  to  the  following  theorem. 

Theorem  5.11  Let  P  be  a  set  of  n  points  in  d -dimensional  space,  with  d  ^  2.  A 
layered  range  tree  for  P  uses  0(nlogd  1  n)  storage  and  it  can  be  constructed  in 
0(n  log*7  1  n)  time.  With  this  range  tree  one  can  report  the  points  in  P  that  lie 
in  a  rectangular  query  range  in  0(logd  1  n  +  k)  time,  where  k  is  the  number  of 
reported  points. 


5.7  Notes  and  Comments 

In  the  1970s — the  early  days  of  computational  geometry — orthogonal  range 
searching  was  one  of  the  most  important  problems  in  the  field,  and  many  people 
worked  on  it.  This  resulted  in  a  large  number  of  results,  of  which  we  discuss  a 
few  below. 

One  of  the  first  data  structures  for  orthogonal  range  searching  was  the 
quadtree,  which  is  discussed  in  Chapter  14  in  the  context  of  meshing.  Un¬ 
fortunately,  the  worst-case  behavior  of  quadtrees  is  quite  bad.  Kd-trees,  de- 
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Chapter  5  scribed  first  by  Bentley  [44]  in  1975,  are  an  improvement  over  quadtrees. 
orthogonal  range  searching  Samet’s  books  [333,  334]  discuss  quadtrees,  kd-trees,  and  their  applications 

in  great  detail.  A  few  years  later,  the  range  tree  was  discovered  indepen¬ 
dently  by  several  people  [46,  251,  261,  387].  The  improvement  in  query  time 
to  0(k)gn  +  k)  by  fractional  cascading  was  described  by  Lueker  [261]  and 
Willard  [386].  Fractional  cascading  applies  in  fact  not  only  to  range  trees,  but 
in  many  situations  where  one  does  many  searches  with  the  same  key.  Chazelle 
and  Guibas  [105,  106]  discuss  this  technique  in  its  full  generality.  Fractional 
cascading  can  also  be  used  in  a  dynamic  setting  [275].  The  most  efficient 
data  structure  for  2-dimensional  range  queries  is  a  modified  version  of  the 
layered  range  tree,  described  by  Chazelle  [87];  he  succeeded  in  improving 
the  storage  to  0(«logn/loglogn)  while  keeping  the  query  time  O(\ogn  +  k). 
Chazelle  [90,  91]  also  proved  that  this  is  optimal.  If  the  query  range  is  un¬ 
bounded  to  one  side,  for  instance  when  it  is  of  the  form  [x  :  x!\  x  [y  :  +°°], 
then  one  can  achieve  O(logn)  query  time  with  only  linear  space,  using  a  pri¬ 
ority  search  tree — see  Chapter  10.  In  higher  dimensions  the  best  result  for 
orthogonal  range  searching  is  also  due  to  Chazelle  [90];  he  gave  a  structure  for 
cZ-dimensional  queries  with  0{n(\ogn/  \og\ogn)d^1)  storage  and  polylogarith- 
mic  query  time.  Again,  this  result  is  optimal.  Trade-offs  between  storage  and 
query  time  are  also  possible  [338,  391], 

The  lower-bound  result  is  only  valid  under  certain  models  of  computation. 
This  allows  for  improved  results  in  specific  cases.  In  particular.  Overmars  [300] 
describes  more  efficient  data  structures  for  range  searching  when  the  points  lie 
on  a  U  x  U  grid,  yielding  query  time  bounds  of  0(loglogU  +  k)  or  0(\fU  +  k), 
depending  on  the  preprocessing  time  allowed.  The  results  use  data  structures 
described  earlier  by  Willard  [389,  390].  When  compared  with  the  general  case, 
better  time  bounds  can  be  obtained  for  many  problems  in  computational  ge¬ 
ometry  once  the  coordinates  of  the  objects  are  restricted  to  lie  on  grid  points. 
Examples  are  the  nearest  neighbor  searching  problem  [224,  225],  point  loca¬ 
tion  [287],  and  line  segment  intersection  [226]. 

In  databases,  range  queries  are  considered  the  most  general  of  three  basic  types 
of  multi-dimensional  queries.  The  two  other  types  are  exact  match  queries 
and  partial  match  queries.  Exact  match  queries  are  simply  queries  of  the 
type:  Is  the  object  (point)  with  attribute  values  (coordinates)  such  and  such 
present  in  the  database?  The  obvious  data  structure  for  exact  match  queries 
is  the  balanced  binary  tree  that  uses,  for  instance,  a  lexicographical  order  on 
the  coordinates.  With  this  structure  exact  match  queries  can  be  answered  in 
O(\ogn)  time.  If  the  dimension — the  number  of  attributes — increases,  it  can 
be  useful  to  express  the  efficiency  of  queries  not  only  in  terms  of  n,  but  also 
in  the  dimension  d.  If  a  binary  tree  is  used  for  exact  match  queries,  the  query 
time  is  Oid logn)  because  it  takes  0(d)  time  to  compare  two  points.  This  can 
easily  be  reduced  to  0(d  +  logn)  time,  which  is  optimal.  A  partial  match  query 
specifies  only  a  value  for  a  subset  of  the  coordinates  and  asks  for  all  points  with 
the  specified  coordinate  values.  In  the  plane,  for  instance,  a  partial  match  query 
specifies  only  an  x-coordinate,  or  only  a  y-coordinate.  Interpreted  geometrically, 
a  partial  match  query  in  the  plane  asks  for  the  points  on  a  horizontal  line,  or  on 
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a  vertical  line.  With  a  ^-dimensional  kd-tree,  a  partial  match  query  specifying  s 
coordinates  (with  s  <  d)  can  be  answered  in  0(nx~sld  +  k )  time,  where  k  is  the 
number  of  reported  points  [44] . 

In  many  applications  the  data  that  we  are  given  are  not  a  set  of  points,  but  a 
set  of  certain  objects  such  as  polygons.  If  we  want  to  report  the  objects  that 
are  completely  contained  in  a  query  range  [x  :  x!\  x  [y  :  y'\,  then  it  is  possible 
to  transform  the  query  to  a  query  on  point  data  in  higher  dimensions — see 
Exercise  5.13.  Often  one  also  wants  to  find  the  objects  that  are  only  partially 
in  the  range.  This  specific  problem  is  called  the  windowing  problem  and  is 
discussed  in  Chapter  10. 

Other  variants  of  the  range  searching  problem  are  obtained  by  allowing 
other  types  of  query  ranges,  such  as  circles  or  triangles.  Many  of  these  variants 
can  be  solved  using  so-called  partition  trees,  which  are  discussed  in  Chapter  16. 


5.8  Exercises 


5.1 


In  the  proof  of  the  query  time  of  the  kd-tree  we  found  the  following 
recurrence: 
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0(1),  if  n  =  1, 

2  +  2Q(n/4),  if  «  >  1 . 


Prove  that  this  recurrence  solves  to  Q(n)  =  0(v/n).  Also  show  that 
Q(y/n)  is  a  lower  bound  for  querying  in  a  kd-tree  by  defining  a  set  of  n 
points  and  a  query  rectangle  appropriately. 


5.2  Describe  algorithms  to  insert  and  delete  points  from  a  kd-tree.  In  your 
algorithm  you  do  not  need  to  take  care  of  rebalancing  the  structure. 


5.3  In  Section  5.2  it  was  indicated  that  kd-trees  can  also  be  used  to  store 
sets  of  points  in  higher-dimensional  space.  Let  P  be  a  set  of  n  points 
in  d -dimensional  space.  In  parts  a.  and  b.  you  may  consider  d  to  be  a 
constant. 

a.  Describe  an  algorithm  to  construct  a  d -dimensional  kd-tree  for  the 
points  in  P.  Prove  that  the  tree  uses  linear  storage  and  that  your 
algorithm  takes  0(n  log/;]  time. 

b.  Describe  the  query  algorithm  for  performing  a  d -dimensional  range 
query.  Prove  that  the  query  time  is  bounded  by  0(nl~1^d  +  k). 

c.  Show  that  the  dependence  on  d  in  the  amount  of  storage  is  linear,  that 
is,  show  that  the  amount  of  storage  is  0(dn)  if  we  do  not  consider  d  to 
be  a  constant.  Give  the  dependence  on  d  of  the  construction  time  and 
the  query  time  as  well. 


5.4  Kd-trees  can  be  used  for  partial  match  queries.  A  2-dimensional  partial 
match  query  specifies  a  value  for  one  of  the  coordinates  and  asks  for 
all  points  that  have  that  value  for  the  specified  coordinate.  In  higher 
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Chapter  5  dimensions  we  specify  values  for  a  subset  of  the  coordinates.  Here  we 

orthogonal  range  searching  allow  multiple  points  to  have  equal  values  for  coordinates. 

a.  Show  that  2-dinrensional  kd-trees  can  answer  partial  match  queries  in 
0(\Jn  +  k)  time,  where  k  is  the  number  of  reported  answers. 

b.  Explain  how  to  use  a  2-dimensional  range  tree  to  answer  partial  match 
queries.  What  is  the  resulting  query  time? 

c.  Describe  a  data  structure  that  uses  linear  storage  and  solves  2-dimen¬ 
sional  partial  match  queries  in  O(\ogn  +  k)  time. 

d.  Show  that  with  a  cZ-dimensional  kd-tree  we  can  solve  a  cZ-dimensional 
partial  match  query  in  0(nx~s!d  +  k)  time,  where  s  (with  s  <d)  is  the 
number  of  specified  coordinates. 

e.  Describe  a  data  structure  that  uses  linear  storage  and  that  can  answer 
cZ-dimensional  partial  match  queries  in  O(logn  +  k)  time.  Hint:  Use  a 
structure  whose  dependence  on  d  in  the  amount  of  storage  is  exponen¬ 
tial  (more  precisely,  a  structure  that  uses  0(d2dn)  storage). 

5.5  Algorithm  SearchKdTree  can  also  be  used  when  querying  with  other 
ranges  than  rectangles.  For  example,  a  query  is  answered  correctly  if  the 
range  is  a  triangle. 

a.  Show  that  the  query  time  for  range  queries  with  triangles  is  linear  in 
the  worst  case,  even  if  no  answers  are  reported  at  all.  Hint:  Choose  all 
points  to  be  stored  in  the  kd-tree  on  the  line  y  =  x. 

b.  Suppose  that  a  data  structure  is  needed  that  can  answer  triangular  range 
queries,  but  only  for  triangles  whose  edges  are  horizontal,  vertical, 
or  have  slope  +1  or  —1.  Develop  a  linear  size  data  structure  that 
answers  such  range  queries  in  0(ni^4  +  k )  time,  where  k  is  the  number 
of  points  reported.  Hint:  Choose  4  coordinate  axes  in  the  plane  and 
use  a  4-dimensional  kd-tree. 

c.  Improve  the  query  time  to  0(n2/3  +k).  Hint:  Solve  Exercise  5.4  first. 

5.6  Describe  algorithms  to  insert  and  delete  points  from  a  range  tree.  You 
don’t  have  to  take  care  of  rebalancing  the  structure. 

5.7  In  the  proof  of  Lemma  5.7  we  made  a  rather  rough  estimate  of  the  query 
time  in  the  associated  structures  by  stating  that  this  was  bounded  by 
O(\ogn).  In  reality  the  query  time  is  dependent  on  the  actual  number  of 
points  in  the  associated  structure.  Let  nv  denote  the  number  of  points  in 
the  canonical  subset  P(v).  Then  the  total  time  spent  is 

£©(log  nv  +  kv), 

V 

where  the  summation  is  over  all  nodes  in  the  main  tree  T  that  are  vis¬ 
ited.  Show  that  this  bound  is  still  ©(log2n  +  k).  (That  is,  our  more 
careful  analysis  only  improves  the  constant  in  the  bound,  not  the  order  of 
magnitude.) 
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5.8  Theorem  5.8  showed  that  a  range  tree  on  a  set  of  n  points  in  the  plane 
requires  O(n\ogn)  storage.  One  could  bring  down  the  storage  require- 


ments  by  storing  associated  structures  only  with  a  subset  of  the  nodes  in  Section  5.8 
the  main  tree.  exercises 

a.  Suppose  that  only  the  nodes  with  depth  0,  2, 4,  6, . . .  have  an  associated 
structure.  Show  how  the  query  algorithm  can  be  adapted  to  answer 
queries  correctly. 

b.  Analyze  the  storage  requirements  and  query  time  of  such  a  data  struc¬ 
ture. 

c.  Suppose  that  only  the  nodes  with  depth  0,  [  j  log nj ,  [ 3  log nj ,  ...  have 
an  associated  structure,  where  j  ^  2  is  a  constant.  Analyze  the  storage 
requirements  and  query  time  of  this  data  structure.  Express  the  bounds 
in  n  and  j. 

5.9  One  can  use  the  data  structures  described  in  this  chapter  to  determine 
whether  a  particular  point  (a,  b)  is  in  a  given  set  by  performing  a  range 
query  with  range  [a  :  a]  x  [b  :  b] . 

a.  Prove  that  performing  such  a  range  query  on  a  kd-tree  takes  time 
O(logn). 

b.  What  is  the  time  bound  for  such  a  query  on  a  range  tree?  Prove  your 
answer. 

5.10  In  some  applications  one  is  interested  only  in  the  number  of  points  that 
lie  in  a  range  rather  than  in  reporting  all  of  them.  Such  queries  are  often 
referred  to  as  range  counting  queries.  In  this  case  one  would  like  to  avoid 
having  an  additive  term  of  0(k)  in  the  query  time. 

a.  Describe  how  a  1 -dimensional  range  tree  can  be  adapted  such  that  a 
range  counting  query  can  be  performed  in  0(\ogn)  time.  Prove  the 
query  time  bound. 

b.  Using  the  solution  to  the  1-dimensional  problem,  describe  how  d- 
dimensional  range  counting  queries  can  be  answered  in  O(\ogd  n) 
time.  Prove  the  query  time. 

c. *  Describe  how  fractional  cascading  can  be  used  to  improve  the  running 

time  with  a  factor  of  O(\ogn)  for  2-  and  higher-dimensional  range 
counting  queries. 

5.11  Let  Si  be  a  set  of  n  disjoint  horizontal  line  segments  and  let  Si  be  a  set 
of  m  disjoint  vertical  line  segments.  Give  a  plane-sweep  algorithm  that 
counts  in  0((n  +  m)\og(n  +  m))  time  how  many  intersections  there  are 
in  Si  US2. 

5.12  In  Section  5.5  it  was  shown  that  one  can  treat  sets  of  points  in  the  plane 
that  contain  equal  coordinate  values  using  composite  numbers.  Extend 
this  notion  to  points  in  ^/-dimensional  space.  To  this  end  you  should  define 
the  composite  number  of  d  numbers  and  define  an  appropriate  order  on 
them.  Next,  show  how  to  transform  the  point  p  :=  (p\. ... . p,i )  and  the 
range  R  :=  [n  :  rj]  x  •  •  •  x  [rj  :  r'd\  using  this  order;  the  transformation 

should  be  such  that  p  €  R  if  and  only  if  the  transformed  point  lies  in  the  _ 
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Chapter  5  5.13  In  many  application  one  wants  to  do  range  searching  among  objects  other 

ORTHOGONAL  RANGE  SEARCHING  than  points. 

a.  Let  S  be  a  set  of  n  axis-parallel  rectangles  in  the  plane.  We  want  to 
be  able  to  report  all  rectangles  in  5  that  are  completely  contained  in 
a  query  rectangle  [x  :  x']  x  [y  :  /].  Describe  a  data  structure  for  this 
problem  that  uses  0{n log3 n)  storage  and  has  (9 (log4/;  +  k)  query 
time,  where  k  is  the  number  of  reported  answers.  Hint:  Transform  the 
problem  to  an  orthogonal  range  searching  problem  in  some  higher¬ 
dimensional  space. 

b.  Let  P  consist  of  a  set  of  n  polygons  in  the  plane.  Again  describe 
a  data  structure  that  uses  0(nlog3n)  storage  and  has  (9(log4»  +  k) 
query  time  to  report  all  polygons  completely  contained  in  the  query 
rectangle,  where  k  is  the  number  of  reported  answers. 

c. *  Improve  the  query  time  of  your  solutions  (both  for  a.  and  b.)  to 

(9(log3  n  +  k). 

5.14*  Prove  the  0(nlogd  1  n )  bounds  on  the  storage  and  construction  time  in 

Theorem  5.11. 
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6  Point  Location 


Knowing  Where  You  Are 


This  book  has,  for  the  most  part,  been  written  in  Europe.  More  precisely,  it  has 
been  written  very  close  to  a  point  at  longitude  5°6'  east  and  latitude  52°3'  north. 
Where  that  is?  You  can  find  that  out  yourself  from  a  map  of  Europe:  using  the 
scales  on  the  sides  of  the  map,  you  will  find  that  the  point  with  the  coordinates 
stated  above  is  located  in  a  little  country  named  “the  Netherlands”. 

In  this  way  you  would  have  answered  a  point  location  query.  Given  a  map 
and  a  query  point  q  specified  by  its  coordinates,  find  the  region  of  the  map 
containing  q.  A  map,  of  course,  is  nothing  more  than  a  subdivision  of  the  plane 
into  regions,  a  planar  subdivision,  as  defined  in  Chapter  2. 


Figure  6. 1 

Point  location  in  a  map 


Point  location  queries  arise  in  various  settings.  Suppose  that  you  are  sailing  on 
a  sea  with  sand  banks  and  dangerous  currents  in  various  parts  of  it.  To  be  able 
to  navigate  safely,  you  will  have  to  know  the  current  at  your  present  position. 
Fortunately  there  are  maps  that  indicate  the  kind  of  current  in  the  various  parts 
of  the  sea.  To  use  such  a  map,  you  will  have  to  do  the  following.  First,  you  must 
determine  your  position.  Until  not  so  long  ago,  you  would  have  to  rely  for  this 


121 


Chapter  6 

POINT  LOCATION 


\ 


on  the  stars  or  the  sun,  and  a  good  chronometer.  Nowadays  it  is  much  easier 
to  determine  your  position:  there  are  little  boxes  on  the  market  that  compute 
your  position  for  you,  using  information  from  various  satellites.  Once  you  have 
determined  the  coordinates  of  your  position,  you  will  have  to  locate  the  point  on 
the  map  showing  the  currents,  or  to  find  the  region  of  the  sea  you  are  presently 
in. 

One  step  further  would  be  to  automate  this  last  step:  store  the  map  electroni¬ 
cally,  and  let  the  computer  do  the  point  location  for  you.  It  could  then  display 
the  current — or  any  other  information  for  which  you  have  a  thematic  map  in 
electronic  form — of  the  region  you  are  in  continuously.  In  this  situation  we  have 
a  set  of  presumably  rather  detailed  thematic  maps  and  we  want  to  answer  point 
location  queries  frequently,  to  update  the  displayed  information  while  the  ship 
is  moving.  This  means  that  we  will  want  to  preprocess  the  maps,  and  to  store 
them  in  a  data  structure  that  makes  it  possible  to  answer  point  location  queries 
fast. 

Point  location  problems  arise  on  a  quite  different  scale  as  well.  Assume  that  we 
want  to  implement  an  interactive  geographic  information  system  that  displays  a 
map  on  a  screen.  By  clicking  with  the  mouse  on  a  country,  the  user  can  retrieve 
information  about  that  country.  While  the  mouse  is  moved  the  system  should 
display  the  name  of  the  country  underneath  the  mouse  pointer  somewhere  on 
the  screen.  Every  time  the  mouse  is  moved,  the  system  has  to  recompute  which 
name  to  display.  Clearly  this  is  a  point  location  problem  in  the  map  displayed 
on  the  screen,  with  the  mouse  position  as  the  query  point.  These  queries  occur 
with  high  frequency — after  all,  we  want  to  update  the  screen  information  in  real 
time — and  therefore  have  to  be  answered  fast.  Again,  we  need  a  data  structure 
that  supports  fast  point  location  queries. 


6.1  Point  Location  and  Trapezoidal  Maps 

Let  §  be  a  planar  subdivision  with  n  edges.  The  planar  point  location  problem 
is  to  store  §  in  such  a  way  that  we  can  answer  queries  of  the  following  type: 
Given  a  query  point  q,  report  the  face  /  of  §  that  contains  q.  If  q  lies  on  an  edge 
or  coincides  with  a  vertex,  the  query  algorithm  should  return  this  information. 

To  get  some  insight  into  the  problem,  let’s  first  give  a  very  simple  data  structure 
to  perform  point  location  queries.  We  draw  vertical  lines  through  all  vertices  of 
the  subdivision,  as  in  Figure  6.2.  This  partitions  the  plane  into  vertical  slabs. 
We  store  the  v-coordi nates  of  the  vertices  in  sorted  order  in  an  array.  This 
makes  it  possible  to  determine  in  O(\ogn)  time  the  slab  that  contains  a  query 
point  q.  Within  a  slab,  there  are  no  vertices  of  8.  This  means  that  the  part  of  the 
subdivision  lying  inside  the  slab  has  a  special  form:  all  edges  intersecting  a  slab 
completely  cross  it — they  have  no  endpoint  in  the  slab — and  they  don’t  cross 
each  other.  This  means  that  they  can  be  ordered  from  top  to  bottom.  Notice 
that  every  region  in  the  slab  between  two  consecutive  edges  belongs  to  a  unique 
face  of  8.  The  lowest  and  highest  region  of  the  slab  are  unbounded,  and  are  part 
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Figure  6.2 
Partition  into  slabs 


of  the  unbounded  face  of  S.  The  special  structure  of  the  edges  intersecting  a 
slab  implies  that  we  can  store  them  in  sorted  order  in  an  array.  We  label  each 
edge  with  the  face  of  §  that  is  immediately  above  it  inside  the  slab. 

The  total  query  algorithm  is  now  as  follows.  First,  we  do  a  binary  search 
with  the  x-coordinate  of  the  query  point  q  in  the  array  storing  the  x-coordinates 
of  the  vertices  of  the  subdivision.  This  tells  us  the  slab  containing  q.  Then  we 
do  a  binary  search  with  q  in  the  array  for  that  slab.  The  elementary  operation 
in  this  binary  search  is:  Given  a  segment  s  and  a  point  q  such  that  the  vertical 
line  through  q  intersects  s ,  determine  whether  q  lies  above  s,  below  s,  or  on  s. 
This  tells  us  the  segment  directly  below  q ,  provided  there  is  one.  The  label 
stored  with  that  segment  is  the  face  of  §  containing  q.  If  we  find  that  there  is  no 
segment  below  q  then  q  lies  in  the  unbounded  face. 


The  query  time  for  the  data  structure  is  good:  we  only  perform  two  binary 
searches,  the  first  in  an  array  of  length  at  most  2 n  (the  n  edges  of  the  subdivision 
have  at  most  2 n  vertices),  and  the  second  in  an  array  of  length  at  most  n  (a  slab 
is  crossed  by  at  most  n  edges).  Hence,  the  query  time  is  <9(logn). 

What  about  the  storage  requirements?  First  of  all,  we  have  an  array  on  the 
x-coordinates  of  the  vertices,  which  uses  0(n)  storage.  But  we  also  have  an 
array  for  every  slab.  Such  an  array  stores  the  edges  intersecting  its  slab,  so 
it  uses  O(n)  storage.  Since  there  are  O(n)  slabs,  the  total  amount  of  storage 
is  0{n2).  It’s  easy  to  give  an  example  where  n/4  slabs  are  intersected  by  n /4 
edges  each,  which  shows  that  this  worst-case  bound  can  actually  occur. 

The  amount  of  storage  required  makes  this  data  structure  rather  uninteresting — 
a  quadratic  size  structure  is  useless  in  most  practical  applications,  even  for 
moderately  large  values  of  n.  (One  may  argue  that  in  practice  the  quadratic 
behavior  does  not  occur.  But  it  is  rather  likely  that  the  amount  of  storage  is 
something  like  0(n%/n).)  Where  does  this  quadratic  behavior  come  from?  Let’s 
have  a  second  look  at  Figure  6.2.  The  segments  and  the  vertical  lines  through  the 
endpoints  define  a  new  subdivision  S',  whose  faces  are  trapezoids,  triangles,  and 
unbounded  trapezoid-like  faces.  Furthermore,  S'  is  a  refinement  of  the  original 
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Chapter  6  subdivision  8:  every  face  of  S'  lies  completely  in  one  face  of  8.  The  query 

point  location  algorithm  described  above  is  in  fact  an  algorithm  to  do  planar  point  location  in 

this  refined  subdivision.  This  solves  the  original  planar  point  location  as  well: 
because  S'  is  a  refinement  of  8,  once  we  know  the  face  f  £  S'  containing  q,  we 
know  the  face  /  £  8  containing  q.  Unfortunately,  the  refined  subdivision  can 
have  quadratic  complexity.  It  is  therefore  not  surprising  that  the  resulting  data 
structure  has  quadratic  size. 

Perhaps  we  should  look  for  a  different  refinement  of  8  that — like  the  decom¬ 
position  shown  above — makes  point  location  queries  easier,  and  that — unlike 
the  decomposition  shown  above — has  a  complexity  that  is  not  much  larger  than 
the  complexity  of  the  original  subdivision  8.  Indeed  such  a  refinement  exists. 
In  the  rest  of  this  section,  we  shall  describe  the  trapezoidal  map,  a  refinement 
that  has  the  desirable  properties  just  mentioned. 

We  call  two  line  segments  in  the  plane  non-crossing  if  their  intersection  is  either 
empty  or  a  common  endpoint.  Notice  that  the  edges  of  any  planar  subdivision 
are  non-crossing. 

Let  S  be  a  set  of  n  non-crossing  segments  in  the  plane.  Trapezoidal  maps 
can  be  defined  for  such  sets  in  general,  but  we  shall  make  two  simplifications 
that  make  life  easier  for  us  in  this  and  the  next  sections. 

First,  it  will  be  convenient  to  get  rid  of  the  unbounded  trapezoid-like  faces 
that  occur  at  the  boundary  of  the  scene.  This  can  be  done  by  introducing  a  large, 
axis-parallel  rectangle  R  that  contains  the  whole  scene,  that  is,  that  contains  all 
segments  of  S.  For  our  application — point  location  in  subdivisions — this  is  not 
a  problem:  a  query  point  outside  R  always  lies  in  the  unbounded  face  of  8,  so 
we  can  safely  restrict  our  attention  to  what  happens  inside  R. 

The  second  simplification  is  more  difficult  to  justify:  we  will  assume  that  no 
two  distinct  endpoints  of  segments  in  the  set  S  have  the  same  x-coordinate.  A 
consequence  of  this  is  that  there  cannot  be  any  vertical  segments.  This  assump¬ 
tion  is  not  very  realistic:  vertical  edges  occur  frequently  in  many  applications, 
and  the  situation  that  two  non-intersecting  segments  have  an  endpoint  with  the 
same  x-coordinate  is  not  so  unusual  either,  because  the  precision  in  which  the  co¬ 
ordinates  are  given  is  often  limited.  We  will  make  this  assumption  nevertheless, 
postponing  the  treatment  of  the  general  case  to  Section  6.3. 

So  we  have  a  set  S  of  n  non-crossing  line  segments,  enclosed  in  a  bounding 
box  R,  and  with  the  property  that  no  two  distinct  endpoints  lie  on  a  common 
vertical  line.  We  call  such  a  set  a  set  of  line  segments  in  general  position. 
The  trapezoidal  map  7 (S)  of  S — also  known  as  the  vertical  decomposition  or 
trapezoidal  decomposition  of  S — is  obtained  by  drawing  two  vertical  extensions 
from  every  endpoint  p  of  a  segment  in  S,  one  extension  going  upwards  and  one 
going  downwards.  The  extensions  stop  when  they  meet  another  segment  of  S  or 
the  boundary  of  R.  We  call  the  two  vertical  extensions  starting  in  an  endpoint  p 
the  upper  vertical  extension  and  the  lower  vertical  extension.  The  trapezoidal 
map  of  S  is  simply  the  subdivision  induced  by  S,  the  rectangle  R,  and  the  upper 
and  lower  vertical  extensions.  Figure  6.3  shows  an  example. 


A  face  in  T(S)  is  bounded  by  a  number  of  edges  of  T(.S').  Some  of  drese 
edges  may  be  adjacent  and  collinear.  We  call  the  union  of  such  edges  a  side  of 
the  face.  In  other  words,  the  sides  of  a  face  are  the  segments  of  maximal  length 
that  are  contained  in  the  boundary  of  a  face. 

Lemma  6.1  Each  face  in  a  trapezoidal  map  of  a  set  S  of  line  segments  in  general 
position  has  one  or  two  vertical  sides  and  exactly  two  non-vertical  sides. 

Proof.  Let  /  be  a  face  in  “7(5).  We  first  prove  that  /  is  convex. 

Because  the  segments  in  5  are  non-crossing,  any  corner  of  /  is  either  an 
endpoint  of  a  segment  in  5,  a  point  where  a  vertical  extension  abuts  a  segment 
of  S  or  an  edge  of  R,  or  it  is  a  corner  of  R.  Due  to  the  vertical  extensions,  no 
corner  that  is  a  segment  endpoint  can  have  an  interior  angle  greater  than  180°. 
Moreover,  any  angle  at  a  point  where  a  vertical  extension  abuts  a  segment  must 
be  less  than  or  equal  to  180°  as  well.  Finally,  the  corners  of  R  are  90°.  Hence, 
/  is  convex — the  vertical  extensions  have  removed  all  non-convexities. 

Because  we  are  looking  at  sides  of  /,  rather  than  at  edges  of  '.T (S)  on  the 
boundary  of  /,  the  convexity  of  /  implies  that  it  can  have  at  most  two  vertical 
sides.  Now  suppose  for  a  contradiction  that  /  has  more  than  two  non-vertical 
sides.  Then  there  must  be  two  such  sides  that  are  adjacent  and  either  both 
bound  /  from  above  or  both  bound  /  from  below.  Because  any  non-vertical 
side  must  be  contained  in  a  segment  of  S  or  in  an  edge  of  R ,  and  the  segments 
are  non-crossing,  the  two  adjacent  sides  must  meet  in  a  segment  endpoint.  But 
then  the  vertical  extensions  for  that  endpoint  prevent  the  two  sides  from  being 
adjacent,  a  contradiction.  Hence,  /  has  at  most  two  non-vertical  sides. 

Finally,  we  observe  that  /  is  bounded  (since  we  have  enclosed  the  whole 
scene  in  a  bounding  box  R),  which  implies  that  it  cannot  have  less  than  two 
non-vertical  sides  and  that  it  must  have  at  least  one  vertical  side.  0 
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top(  A) 


bottom(  A) 


Lemma  6.1  shows  that  the  trapezoidal  map  deserves  its  name:  each  face 
is  either  a  trapezoid  or  a  triangle,  which  we  can  view  as  a  trapezoid  with  one 
degenerate  edge  of  length  zero. 

In  the  proof  of  Lemma  6. 1  we  observed  that  a  non-vertical  side  of  a  trapezoid 
is  contained  in  a  segment  of  S  or  in  a  horizontal  edge  of  R.  We  denote  the  non¬ 
vertical  segment  of  S,  or  edge  of  R,  bounding  a  trapezoid  A  from  above  by 
top( A),  and  the  one  bounding  it  from  below  by  bottom(A). 

By  the  general  position  assumption,  a  vertical  side  of  a  trapezoid  either 
consists  of  vertical  extensions,  or  it  is  the  vertical  edge  of  R.  More  precisely, 
we  can  distinguish  five  different  cases  for  the  left  side  and  the  right  side  of  a 
trapezoid  A.  The  cases  for  the  left  side  are  as  follows: 

(a)  It  degenerates  to  a  point,  which  is  the  common  left  endpoint  of  top  (A)  and 
bottom(A'). 

(b)  It  is  the  lower  vertical  extension  of  the  left  endpoint  of  top  (A)  that  abuts 
on  bottom(A). 

(c)  It  is  the  upper  vertical  extension  of  the  left  endpoint  of  bottom^ A)  that 
abuts  on  top( A). 

fd)  It  consists  of  the  upper  and  lower  extension  of  the  right  endpoint  p  of  a  third 
segment  s.  These  extensions  abut  on  top  (A)  and  bottom^  A),  respectively, 
(e)  It  is  the  left  edge  of  R.  This  case  occurs  for  a  single  trapezoid  of  T(S)  only, 
namely  the  unique  leftmost  trapezoid  of  T(S). 

The  first  four  cases  are  illustrated  in  Figure  6.4.  The  five  cases  for  the  right 
vertical  edge  of  A  are  symmetrical.  You  should  verify  for  yourself  that  the 
listing  above  is  indeed  exhaustive. 


leftp(A) 

^  leftp(A) 

^  top  (A) 

- ,0^^top(^)00^ 

'bottom^  A) 

9<^t>ottom  (A) 

bottom  (A) 

f 

Figure  6.4 

Four  of  the  five  cases  for  the  left  edge 

~~ — - 

V —  leftp(  A) 

of  trapezoid  A 

(a) 

(b) 

(c) 

leftp(A) 


(d) 


For  every  trapezoid  A  £  T(S),  except  the  leftmost  one,  the  left  vertical  edge 
of  A  is,  in  a  sense,  defined  by  a  segment  endpoint  p:  it  is  either  contained  in 
the  vertical  extensions  of  p,  or — when  it  is  degenerate — it  is  p  itself.  We  will 
denote  the  endpoint  defining  the  left  edge  of  A  by  leftp(A).  As  shown  above, 
leftp(A)  is  the  left  endpoint  of  top( A)  or  bottom^ A),  or  it  is  the  right  endpoint 
of  a  third  segment.  For  the  unique  trapezoid  whose  left  side  is  the  left  edge  of 
R,  we  define  leftp(A)  to  be  the  lower  left  vertex  of  R.  Similarly,  we  denote  the 
endpoint  that  defines  the  right  vertical  edge  of  A  by  rightp(A).  Notice  that  A  is 
uniquely  determined  by  top( A),  bottom(A),  leftp( A),  and  rightp(A).  Therefore 
we  will  sometimes  say  that  A  is  defined  by  these  segments  and  endpoints. 
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The  trapezoidal  map  of  the  edges  of  a  subdivision  is  a  refinement  of  that 
subdivision.  It  is  not  clear,  however,  why  point  location  in  a  trapezoidal  map 
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MAPS 

Lemma  6.2  The  trapezoidal  map  T(S')  of  a  set  S  of  n  line  segments  in  general 
position  contains  at  most  6 n  +  4  vertices  and  at  most  3 n  +  1  trapezoids. 

Proof.  A  vertex  of  T(5)  is  either  a  vertex  of  R.  an  endpoint  of  a  segment  in  S, 
or  else  the  point  where  the  vertical  extension  starting  in  an  endpoint  abuts  on 
another  segment  or  on  the  boundary  of  R.  Since  every  endpoint  of  a  segment 
induces  two  vertical  extensions — one  upwards,  one  downwards — this  implies 
that  the  total  number  of  vertices  is  bounded  by  4  +  2 n  +  2(2 n)  =  6 n  +  4. 

A  bound  on  the  number  of  trapezoids  follows  from  Euler’s  formula  and  the 
bound  on  the  number  of  vertices.  Here  we  give  a  direct  proof,  using  the  point 
leftp( A).  Recall  that  each  trapezoid  has  such  a  point  leftp(A).  This  point  is  the 
endpoint  of  one  of  the  n  segments,  or  is  the  lower  left  corner  of  R.  By  looking 
at  the  five  cases  for  the  left  side  of  a  trapezoid,  we  find  that  the  lower  left  corner 
of  R  plays  this  role  for  exactly  one  trapezoid,  a  right  endpoint  of  a  segment  can 
play  this  role  for  at  most  one  trapezoid,  and  a  left  endpoint  of  a  segment  can  be 
the  leftp(A)  of  at  most  two  different  trapezoids.  (Since  endpoints  can  coincide, 
a  point  in  the  plane  can  be  leftp(A)  for  many  trapezoids.  However,  if  in  case  (a) 
we  consider  leftp(A)  to  be  the  left  endpoint  of  bottom^ A),  then  the  left  endpoint 
of  a  segment  ,v  can  be  leftp(A)  for  only  two  trapezoids,  one  above  ,v  and  one 
below  s.)  It  follows  that  the  total  number  of  trapezoids  is  at  most  3n+  1.  0 

We  call  two  trapezoids  A  and  A'  adjacent  if  they  meet  along  a  vertical  edge. 

In  Figure  6.5(i),  for  example,  trapezoid  A  is  adjacent  to  A] ,  A2,  and  A3,  but  not 
to  A4  and  A5.  Because  the  set  of  line  segments  is  in  general  position,  a  trapezoid 
has  at  most  four  adjacent  trapezoids.  If  the  set  is  not  in  general  position,  a 
trapezoid  can  have  an  arbitrary  number  of  adjacent  trapezoids,  as  illustrated  in 
Figure  6.5(ii).  Let  A'  be  a  trapezoid  that  is  adjacent  to  A  along  the  left  vertical 


should  be  any  easier  than  point  location  in  a  general  subdivision.  But  before 
we  come  to  this  in  the  next  section,  let’s  first  verify  that  the  complexity  of  the 
trapezoidal  map  is  not  too  much  larger  than  the  number  of  segments  in  the  set 
defining  it. 


Figure  6.5 

Trapezoids  adjacent  to  A  are  shaded 


edge  of  A.  Then  either  top( A)  =  top(A')  or  bottom(A)  =  bottom(A').  In  the  first 
case  we  call  A'  the  upper  left  neighbor  of  A,  and  in  the  second  case  A'  is  the 
lower  left  neighbor  of  A.  So  the  trapezoid  in  Figure  6.4(b)  has  a  bottom  left 
neighbor  but  no  top  left  neighbor,  the  trapezoid  in  Figure  6.4(c)  has  a  top  left 
neighbor  but  no  bottom  left  neighbor,  and  the  trapezoid  in  Figure  6.4(d)  has  both 
a  top  left  neighbor  and  a  bottom  left  neighbor.  The  trapezoid  in  Figure  6.4(a) 
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Chapter  6  and  the  single  trapezoid  whose  left  vertical  edge  is  the  left  side  of  R  have  no  left 

point  location  neighbors.  The  upper  right  neighbor  and  lower  right  neighbor  of  a  trapezoid 

are  defined  similarly. 

To  represent  a  trapezoidal  map,  we  could  use  the  doubly-connected  edge 
list  described  in  Chapter  2;  after  all,  a  trapezoidal  map  is  a  planar  subdivision. 
However,  the  special  shape  of  the  trapezoidal  map  makes  it  more  convenient 
to  use  a  specialized  structure.  This  structure  uses  the  adjacency  of  trapezoids 
to  link  the  subdivision  as  a  whole.  There  are  records  for  all  line  segments  and 
endpoints  of  5,  since  they  serve  as  leftp( A),  rightp(A),  top(A),  and  bottom(A). 
Furthermore,  the  structure  contains  records  for  the  trapezoids  of  T(S),  but  not 
for  edges  or  vertices  of  7(S).  The  record  for  a  trapezoid  A  stores  pointers  to 
top( A)  and  bottom^ A),  pointers  to  leftp(A)  and  rightp(A),  and  finally,  pointers 
to  its  at  most  four  neighbors.  Note  that  the  geometry  of  a  trapezoid  A  (that  is, 
the  coordinates  of  its  vertices)  is  not  available  explicitly.  However,  A  is  uniquely 
defined  by  top( A),  bottom( A),  leftp(A),  and  rightp(A).  This  means  that  we  can 
deduce  the  geometry  of  A  in  constant  time  from  the  information  stored  for  A. 


6.2  A  Randomized  Incremental  Algorithm 

In  this  section  we  will  develop  a  randomized  incremental  algorithm  that  con¬ 
structs  the  trapezoidal  map  T(S)  of  a  set  S  of  n  line  segments  in  general  position. 
During  the  construction  of  the  trapezoidal  map,  the  algorithm  also  builds  a  data 
structure  ©  that  can  be  used  to  perform  point  location  queries  in  T(S).  This  is 
the  reason  why  a  plane  sweep  algorithm  isn’t  chosen  to  construct  the  trapezoidal 
map.  It  would  construct  it  all  right,  but  it  wouldn’t  give  us  a  data  structure  that 
supports  point  location  queries,  which  is  the  main  objective  of  this  chapter. 

Before  discussing  the  algorithm,  we  first  describe  the  point  location  data  struc¬ 
ture  ©  that  the  algorithm  constructs.  This  structure,  which  we  call  the  search 
structure,  is  a  directed  acyclic  graph  with  a  single  root  and  exactly  one  leaf  for 
every  trapezoid  of  the  trapezoidal  map  of  S.  Its  inner  nodes  have  out-degree  2. 
There  are  two  types  of  inner  nodes:  x-nodes,  which  are  labeled  with  an  endpoint 
of  some  segment  in  5,  and  y-nodes,  which  are  labeled  with  a  segment  itself. 

A  query  with  a  point  q  starts  at  the  root  and  proceeds  along  a  directed  path 
to  one  of  the  leaves.  This  leaf  corresponds  to  the  trapezoid  A  £  T(S)  containing 
q.  At  each  node  on  the  path,  q  has  to  be  tested  to  determine  in  which  of  the  two 
child  nodes  to  proceed.  At  an  x-node,  the  test  is  of  the  form:  “Does  q  lie  to  the 
left  or  to  the  right  of  the  vertical  line  through  the  endpoint  stored  at  this  node?” 
At  a  y-node,  the  test  has  the  form:  “Does  q  lie  above  or  below  the  segment  s 
stored  here?”  We  will  ensure  that  whenever  we  come  to  a  y-node,  the  vertical 
line  through  q  intersects  the  segment  of  that  node,  so  that  the  test  makes  sense. 
The  tests  at  the  inner  nodes  only  have  two  outcomes:  left  or  right  of  an  endpoint 
for  an  x-node,  and  above  or  below  a  segment  for  a  y-node.  What  should  we 
do  if  the  query  point  lies  exactly  on  the  vertical  line,  or  on  the  segment?  For 
now,  we  shall  simply  make  the  assumption  that  this  does  not  occur;  Section  6.3, 
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which  shows  how  to  deal  with  sets  of  segments  that  are  not  in  general  position, 
will  also  deal  with  this  type  of  query  point. 

The  search  structure  CD  and  the  trapezoidal  map  T (S)  computed  by  the 
algorithm  are  interlinked:  a  trapezoid  A  £  Tf.S'j  has  a  pointer  to  the  leaf  of  CD 
corresponding  to  it,  and  a  leaf  node  of  CD  has  a  pointer  to  the  corresponding 
trapezoid  in  TfSj.  Figure  6.6  shows  the  trapezoidal  map  of  a  set  of  two  line 


Figure  6.6 

The  trapezoidal  map  of  two  segments 
and  a  search  structure 

segments  si  and  S2,  and  a  search  structure  for  the  trapezoidal  map.  The  .\-nodes 
are  white,  with  the  endpoint  labeling  it  inside.  The  y- nodes  are  grey  and  have 
the  segment  labeling  it  inside.  The  leaves  of  the  search  structure  are  shown  as 
squares,  and  are  labeled  with  the  corresponding  trapezoid  in  the  trapezoidal 
map. 

The  algorithm  we  give  for  the  construction  of  the  search  structure  is  incremental: 
it  adds  the  segments  one  at  a  time,  and  after  each  addition  it  updates  the  search 
structure  and  the  trapezoidal  map.  The  order  in  which  the  segments  are  added 
influences  the  search  structure;  some  orders  lead  to  a  search  structure  with  a 
good  query  time,  while  for  others  the  query  time  will  be  bad.  Instead  of  trying 
to  be  clever  in  finding  a  suitable  order,  we  shall  take  the  same  approach  we  took 
in  Chapter  4,  where  we  studied  linear  programming:  we  use  a  random  ordering. 

So  the  algorithm  will  be  randomized  incremental.  Later  we  will  prove  that  the 
search  structure  resulting  from  a  randomized  incremental  algorithm  is  expected 
to  be  good.  But  first  we  describe  the  algorithm  in  more  detail.  We  begin  with  its 
global  structure;  the  various  substeps  will  be  explained  after  that. 

Algorithm  TrapezoidalMap(S) 

Input.  A  set  S  of  n  non-crossing  line  segments. 

Output.  The  trapezoidal  map  ‘T (.S' j  and  a  search  structure  CD  for  Tf.S'j  in  a 
bounding  box. 

1 .  Determine  a  bounding  box  R  that  contains  all  segments  of  .S,  and  initialize 
the  trapezoidal  map  structure  T  and  search  structure  CD  for  it. 

2.  Compute  a  random  permutation  S\,S2,- ■ ■  ,sn  of  the  elements  of  S. 

3.  for  i  <—  1  to  n 

4.  do  Find  the  set  Ao,  Ai, . . .  ,A*  of  trapezoids  in  T  properly  intersected 

by  Si. 

5.  Remove  Ao,  Ai, . . . ,  A^  from  T  and  replace  them  by  the  new  trape¬ 
zoids  that  appear  because  of  the  insertion  of 
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Remove  the  leaves  for  Ao,  Ai . . . . .  A/;  from  CD,  and  create  leaves  for 
the  new  trapezoids.  Link  the  new  leaves  to  the  existing  inner  nodes 
by  adding  some  new  inner  nodes,  as  explained  below. 
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We  now  describe  the  various  steps  of  the  algorithm  in  more  detail.  In  the  follow¬ 
ing,  we  let  Si  :=  { a i , ^2 ,  •  •  The  loop  invariant  of  TrapezoidalMap  is 
that  T  is  the  trapezoidal  map  for  .S-,,  and  that  CD  is  a  valid  search  structure  for  CT. 

The  initialization  of  T  as  CTf.S'o)  =  T(0)  and  of  CD  in  line  1  is  easy:  the 
trapezoidal  map  for  the  empty  set  consists  of  a  single  trapezoid — the  bounding 
rectangle  R — and  the  search  structure  for  T (0)  consists  of  a  single  leaf  node 
for  this  trapezoid.  For  the  computation  of  the  random  permutation  in  line  2  see 
Chapter  4.  Now  let’s  see  how  to  insert  a  segment  ,v(-  in  lines  4—6. 

To  modify  the  current  trapezoidal  map.  we  first  have  to  know  where  it 
changes.  This  is  exactly  at  the  trapezoids  that  are  intersected  by  .v,-.  Stated 
more  precisely,  a  trapezoid  of  T(S,_i)  is  not  present  in  T (Sf)  if  and  only  if  it  is 
intersected  by  s Our  first  task  is  therefore  to  find  the  intersected  trapezoids. 
Let  Ao,Ai, . . .  ,Afc  denote  these  trapezoids,  ordered  from  left  to  right  along  ,v,-. 
Observe  that  AJ+i  must  be  one  of  the  right  neighbors  of  A;.  It  is  also  easy 
to  test  which  neighbor  it  is:  if  rightp(Aj)  lies  above  then  A/+]  is  the  lower 
right  neighbor  of  Ay,  otherwise  it  is  the  upper  right  neighbor.  This  means  that 
once  we  know  Ao,  we  can  find  Ai, . . . , A^  by  traversing  the  representation  of 
the  trapezoidal  map.  So,  to  get  started,  we  need  to  find  the  trapezoid  Ao  G  T 
containing  the  left  endpoint,  p,  of  s,.  If  p  is  not  yet  present  in  Si-  \  as  an 
endpoint,  then,  because  of  our  general  position  assumption,  it  must  lie  in  the 
interior  of  Ao.  This  means  we  can  find  Ao  by  a  point  location  with  p  in 
And  now  comes  the  exciting  part:  at  this  stage  in  the  algorithm  CD  is  a  search 
structure  for  T  =  T(5,-_i),  so  all  that  we  need  to  do  is  to  perform  a  query  on  CD 
with  the  point  p. 

If  p  is  already  an  endpoint  of  a  segment  in  S,  \ — remember  that  we  allow 
different  segments  to  share  an  endpoint — then  we  must  be  careful.  To  find  Ao, 
we  simply  start  to  search  in  CD.  If  p  happens  not  to  be  present  yet,  then  the  query 
algorithm  will  proceed  without  trouble,  and  end  up  in  the  leaf  corresponding 
to  Ao.  If,  however,  p  is  already  present,  then  the  following  will  happen:  at  some 
point  during  the  search,  p  will  lie  on  the  vertical  line  through  the  point  in  an 
x-node.  Recall  that  we  decided  that  such  query  points  are  illegal.  To  remedy 
this,  we  should  imagine  continuing  the  query  with  a  point  p'  slightly  to  the 
right  of  p.  Replacing  p  by  p'  is  only  done  conceptually.  What  it  actually  means 
when  implementing  the  search  is  this:  whenever  p  lies  on  the  vertical  line  of 
an  x-node,  we  decide  that  it  lies  to  the  right.  Similarly,  whenever  p  lies  on  a 
segment  s  of  a  y-node  (this  can  only  happen  if  s,-  shares  its  left  endpoint,  p ,  with 
s)  we  compare  the  slopes  of  s  and  sp,  if  the  slope  of is  larger,  we  decide  that  p 
lies  above  s ,  otherwise  we  decide  that  it  is  below  ,v.  With  this  adaptation,  the 
search  will  end  in  the  first  trapezoid  Ao  intersected  properly  by  .v,-.  In  summary, 
we  use  the  following  algorithm  to  find  Ao, . . . ,  A*. 

Algorithm  FollowSegment(T,  CD,  s,-) 

Input.  A  trapezoidal  map  T,  a  search  structure  CD  for  CT,  and  a  new  segment  sp 
Output.  The  sequence  Aq,  . . . ,  A^  of  trapezoids  intersected  by  ,v,-. 


1.  Let  p  and  q  be  the  left  and  right  endpoint  of  s,. 

2.  Search  with  p  in  the  search  structure  CD  to  find  Ao. 

3.  j  <—  0; 

4.  while  q  lies  to  the  right  of  rightp(Aj) 

5.  do  if  rightp(Aj)  lies  above  Sj 

6.  then  Let  A;+i  be  the  lower  right  neighbor  of  Ay. 

7.  else  Let  Ay+i  be  the  upper  right  neighbor  of  A  j. 

8-  7  —  7+1 

9.  return  Ao,A1; A, 

We  have  seen  how  to  find  the  trapezoids  intersecting  ,v,-.  The  next  step  is  to 
update  T  and  CD.  Let’s  start  with  the  simple  case  that  Sj  is  completely  contained 
in  a  trapezoid  A  =  Ao.  We  are  in  the  situation  depicted  on  the  left  hand  side  of 
Figure  6.7. 


To  update  T,  we  delete  A  from  T,  and  replace  it  by  four  new  trapezoids  A,  B, 
C,  and  D.  Notice  that  all  the  information  we  need  to  initialize  the  records  for  the 
new  trapezoids  correctly  (their  neighbors,  the  segments  on  the  top  and  bottom, 
and  the  points  defining  their  left  and  right  vertical  edges)  are  available:  they  can 
be  determined  in  constant  time  from  the  segment  ,v,  and  the  information  stored 
for  A. 

It  remains  to  update  CD.  What  we  must  do  is  replace  the  leaf  for  A  by  a  little 
tree  with  four  leaves.  The  tree  contains  two  x-nodes,  testing  with  the  left  and 
right  endpoint  of  .v/,  and  one  y-node,  testing  with  the  segment  ,sy  itself.  This 
is  sufficient  to  determine  in  which  of  the  four  new  trapezoids  A,  B,  C,  or  D  a 
query  point  lies,  if  we  already  know  that  it  lies  in  A.  The  right  hand  side  of 
Figure  6.7  illustrates  the  modifications  to  the  search  structure.  Note  that  one  or 
both  endpoints  of  the  segment  Sj  could  be  equal  to  leftp(A)  or  rightp(A).  In  that 
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Figure  6. 7 

The  new  segment  Sj  lies  completely  in 
trapezoid  A 
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Chapter  6  case  there  would  be  only  two  or  three  new  trapezoids,  but  the  modification  is 
point  location  done  in  the  same  spirit. 

The  case  where  Sj  intersects  two  or  more  trapezoids  is  only  slightly  more 
complicated.  Let  Ao,  A],  ....  A*  be  the  sequence  of  intersected  trapezoids. 
To  update  7,  we  first  erect  vertical  extensions  through  the  endpoints  of  ,v,-, 


Figure  6.8 

Segment  s,  intersects  four  trapezoids 


partitioning  Ao  and  A*  into  three  new  trapezoids  each.  This  is  only  necessary  for 
the  endpoints  of  ,v,-  that  were  not  already  present.  Then  we  shorten  the  vertical 
extensions  that  now  abut  on  s,-.  This  amounts  to  merging  trapezoids  along  the 
segment  s see  Figure  6.8.  Using  the  information  stored  with  the  trapezoids 
Ao,  Ai,  ....  A*,  this  step  can  be  done  in  time  that  is  linear  in  the  number  of 
intersected  trapezoids. 

To  update  D,  we  have  to  remove  the  leaves  for  Ao,  A],  ....  A/.,  we  must 
create  leaves  for  the  new  trapezoids,  and  we  must  introduce  extra  inner  nodes. 
More  precisely,  we  proceed  as  follows.  If  Ao  has  the  left  endpoint  of  s,-  in  its 
interior  (which  means  it  has  been  partitioned  into  three  new  trapezoids)  then  we 
replace  the  leaf  for  Ao  with  an  x-node  for  the  left  endpoint  of  .v,  and  a  y-node 
for  the  segment  s,-.  Similarly,  if  A*  has  the  right  endpoint  of  Sj  in  its  interior,  we 
replace  the  leaf  for  A*  with  an  x-node  for  the  right  endpoint  of  ,v,  and  a  y-node 
for  Si.  Finally,  the  leaves  of  Ai  to  A*_i  are  replaced  with  single  y-nodes  for  the 
segment  s,  .  We  make  the  outgoing  edges  of  the  new  inner  nodes  point  to  the 
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correct  new  leaves.  Notice  that,  due  to  the  fact  that  we  have  merged  trapezoids  Section  6.2 

stemming  from  different  trapezoids  of  7,  there  can  be  several  incoming  edges  a  randomized  incremental 

for  a  new  trapezoid.  Figure  6.8  illustrates  this.  algorithm 

We  have  finished  the  description  of  the  algorithm  that  constructs  T(s)  and. 
at  the  same  time,  builds  a  search  structure  D  for  it.  The  correctness  of  the 
algorithm  follows  directly  from  the  loop  invariant  (stated  just  after  Algorithm 
TrapezoidalMap),  so  it  remains  to  analyze  its  performance. 

The  order  in  which  we  treat  the  segments  has  considerable  influence  on  the 
resulting  search  structure  D  and  on  the  running  time  of  the  algorithm  itself.  For 
some  cases  the  resulting  search  structure  has  quadratic  size  and  linear  search 
time,  but  for  other  permutations  of  the  same  set  of  segments  the  resulting  search 
structure  is  much  better.  As  in  Chapter  4,  we  haven’t  tried  to  determine  a 
good  sequence;  we  simply  took  a  random  insertion  sequence.  This  means  that 
the  analysis  will  be  probabilistic:  we  shall  look  at  the  expected  performance 
of  the  algorithm  and  of  the  search  structure.  Perhaps  it’s  not  quite  clear  yet 
what  the  term  “expected”  means  in  this  context.  Consider  a  fixed  set  S  of  n 
non-crossing  line  segments.  TrapezoidalMap  computes  a  search  structure 
CD  for  7(S).  This  structure  depends  on  the  permutation  chosen  in  line  2.  Since 
there  are  n\  possible  permutations  of  n  objects,  there  are  n\  possible  ways  in 
which  the  algorithm  can  proceed.  The  expected  running  time  of  the  algorithm 
is  the  average  of  the  running  time,  taken  over  all  n\  permutations.  For  each 
permutation,  a  different  search  structure  will  result.  The  expected  size  of  CD  is 
the  average  of  the  sizes  of  all  these  n!  search  structures.  Finally,  the  expected 
query  time  for  a  point  q  is  the  average  of  the  query  time  for  point  q,  over  all  n ! 
search  structures.  (Notice  that  this  not  the  same  as  the  average  of  the  maximum 
query  time  for  the  search  structure.  Proving  a  bound  on  this  quantity  is  a  bit 
more  technical,  so  we  defer  that  to  Section  6.4.) 

Theorem  6.3  Algorithm  TrapezoidalMap  computes  the  trapezoidal  map 
7(S)  of  a  set  S  of  n  line  segments  in  general  position  and  a  search  structure  CD 
for7(S)  in  0(n  log/;)  expected  time.  The  expected  size  of  the  search  structure 
is  0(n)  and  for  any  query  point  q  the  expected  query  time  is  O(logn). 

Proof.  As  noted  earlier,  the  correctness  of  the  algorithm  follows  directly  from 
the  loop  invariant,  so  we  concentrate  on  the  performance  analysis. 

We  start  with  the  query  time  of  the  search  structure  CD.  Let  q  be  a  fixed 
query  point.  As  the  query  time  for  q  is  linear  in  the  length  of  the  path  in  CD  that 
is  traversed  when  querying  with  q ,  it  suffices  to  bound  the  path  length.  A  simple 
case  analysis  shows  that  the  depth  of  CD  (that  is,  the  maximum  path  length) 
increases  by  at  most  3  in  every  iteration  of  the  algorithm.  Hence,  3 n  is  an  upper 
bound  on  the  query  time  for  q.  This  bound  is  the  best  possible  worst-case  bound 
over  all  possible  insertion  orders  for  S.  We  are  not  so  much  interested  in  the 
worst-case  case  behavior,  however,  but  in  the  expected  behavior:  we  want  to 
bound  the  average  query  time  for  q  with  respect  to  the  n !  possible  insertion 
orders. 

Consider  the  path  traversed  by  the  query  for  q  in  D.  Every  node  on  this  path 
was  created  at  some  iteration  of  the  algorithm.  Let  A,,  for  \  f  i  f  n,  denote 
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Chapter  6  the  number  of  nodes  on  the  path  created  in  iteration  i.  Since  we  consider  S  and 

point  location  q  to  be  fixed,  X,  is  a  random  variable — it  depends  on  the  random  order  of  the 

segments  only.  We  can  now  express  the  expected  path  length  as 

E[£*]  =  £  Eft-]. 

i=  1  i=  1 

The  equality  here  is  linearity  of  expectation-,  the  expected  value  of  a  sum  is 
equal  to  the  sum  of  the  expected  values. 

We  already  observed  that  any  iteration  adds  at  most  three  nodes  to  the  search 
path  for  any  possible  query  point,  so  Xj  f  3.  In  other  words,  if  Pt  denotes  the 
probability  that  there  exists  a  node  on  the  search  path  of  q  that  is  created  in 
iteration  i,  we  have 

E[Xi]  f  3 Pi. 

The  central  observation  to  derive  a  bound  on  Pj  is  this:  iteration  i  contributes  a 
node  to  the  search  path  of  q  exactly  if  Ar/  (.S',  i ) ,  the  trapezoid  containing  q  in 
T(5,_i),  is  not  the  same  as  A q(Si),  the  trapezoid  containing  q  in  T(5,-).  In  other 
words, 

Pi  =  Pr[A9(5,-)  7^  A?(5,;_1)]. 

If  A q(Sj)  is  not  the  same  as  Af/(S;_i),  then  A q(S/)  must  be  one  of  the  trapezoids 
created  in  iteration  i.  Note  that  all  trapezoids  A  created  in  iteration  i  are  adjacent 
to  the  segment  that  is  inserted  in  that  iteration:  either  top( A)  or  bottom(A)  is 
Si,  or  leftp(A)  or  rightp(A)  is  an  endpoint  of  s,-. 

Now  consider  a  fixed  set  Si  C  S.  The  trapezoidal  map  Tf.Sjj,  and  therefore 
A q(Si),  are  uniquely  defined  as  a  function  of  S, ;  A q(S/)  does  not  depend  on  the 
order  in  which  the  segments  in  .S’,  have  been  inserted.  To  bound  the  probability 
that  the  trapezoid  containing  q  has  changed  due  to  the  insertion  of  ,q,  we  shall 
use  a  trick  we  also  used  in  Chapter  4,  called  backwards  analysis:  we  consider 
T (Si)  and  look  at  the  probability  that  A q(Sj)  disappears  from  the  trapezoidal  map 
when  we  remove  the  segment  s,-.  By  what  we  said  above,  A 9(S,-)  disappears  if 
and  only  if  one  of  top{Aq{S/)),  bottom(Aq(Si)),  leftp(Aq(Sj)),  or  rightp(Aq(Si )) 
disappears  with  the  removal  of  s,-.  What  is  the  probability  that  top(Aq(Sj)) 
disappears?  The  segments  of  S,  have  been  inserted  in  random  order,  so  every 
segment  in  Si  is  equally  likely  to  be  ,v,.  This  means  that  the  probability  that  ,v, 
happens  to  be  top(Aq(S/))  is  1  /;.  (If  top(Aq(Si))  is  the  top  edge  of  the  rectangle 
R  surrounding  the  scene,  then  the  probability  is  even  zero.)  Similarly,  the 
probability  that happens  to  be  bottom(Aq(Si ))  is  at  most  l/i.  There  can  be 
many  segments  sharing  the  point  Ieftp(Aq(Si)).  Hence,  the  probability  that  ,v, 
is  one  of  these  segments  can  be  large.  But  leftp(Aq(Si))  disappears  only  if.?,-  is 
the  only  segment  in  Sj  with  leftp(Aq(Si))  as  an  endpoint.  Hence,  the  probability 
that  leftp(Aq(Si))  disappears  is  at  most  l/i  as  well.  The  same  holds  true  for 
rightp(Aq(Sj)).  Hence,  we  can  conclude  that 

Pi  =  Pr[A,(Si)  ±  A ,($_!)]  =  Pr[A ,(S,-)  g  7(Sf_t)]  <  4/i. 

_  (A  small  technical  point:  In  the  argument  above  we  fixed  the  set  Sj.  This  means 

134  that  the  bound  we  derived  on  Pj  holds  under  the  condition  that  .S-,  is  this  fixed 


set.  But  since  the  bound  does  not  depend  on  what  the  fixed  set  actually  is,  the  Section  6.2 
bound  holds  unconditionally.)  a  randomized  incremental 

Putting  it  all  together  we  get  the  bound  on  the  expected  query  time:  algorithm 


n  n  n  i  ^  n  i 

E[£X/]  <  £  3P'  <  E  —  =  12  E  -  =  UHn- 

i—  1  i=  1  i= 1  1  i=  I  * 


Here,  //„  is  the  u-th  harmonic  number ,  defined  as 


fl»:= 


1  1 
2  +  3 


1 

n 


Harmonic  numbers  arise  quite  often  in  the  analysis  of  algorithms,  so  it’s  good 
to  remember  the  following  bound  that  holds  for  all  n  >  1: 


Inn  <  Hn  <  In n  +  1. 

(It  can  be  derived  by  comparing  //„  to  the  integral  f"  1/x  dx  =  Inn.)  We 
conclude  that  the  expected  query  time  for  a  query  point  q  is  (9(logn),  as  claimed. 

Let’s  now  turn  to  the  size  of  CD.  To  bound  the  size,  it  suffices  to  bound  the 
number  of  nodes  in  CD.  We  first  note  that  the  leaves  in  CD  are  in  one-to-one 
correspondence  with  the  trapezoids  in  A,  of  which  there  are  0(n)  by  Lemma  6.2. 
This  means  that  the  total  number  of  nodes  is  bounded  by 

n 

0(n)  +  ^(number  of  inner  nodes  created  in  iteration  i). 
i=i 

Let  kj  be  the  number  of  new  trapezoids  that  are  created  in  iteration  i,  due  to  the 
insertion  of  segment  s,-.  In  other  words,  kj  is  the  number  of  new  leaves  in  CD. 
The  number  of  inner  nodes  created  in  iteration  i  is  exactly  equal  to  kj  —  1 .  A 
simple  worst  case  upper  bound  on  kj  follows  from  the  fact  that  the  number  of 
new  trapezoids  in  7 (Si)  can  obviously  not  be  larger  than  the  total  number  of 
trapezoids  in  T (Si),  which  is  0(i).  This  leads  to  a  worst-case  upper  bound  on 
the  size  of  the  structure  of 

0(n)  +  Yj0(i)  =  0(n2). 
i=  1 

Indeed,  if  we  have  bad  luck  and  the  order  in  which  we  insert  the  segments  is 
very  unfortunate,  then  the  size  of  CD  can  be  quadratic.  However,  we  are  more 
interested  in  the  expected  size  of  the  data  structure,  over  all  possible  insertion 
orders.  Using  linearity  of  expectation,  we  find  that  this  expected  size  is  bounded 
by 

0(n)+E[£(*,-l)]=0(n)  +  £E[*/]. 

i=  1  i=  1 

It  remains  to  bound  the  expected  value  of  kj.  We  already  prepared  the  necessary 
tools  for  this  when  we  derived  the  bound  on  the  query  time.  Consider  a  fixed 
set  S{  C  S.  For  a  trapezoid  A  £  7 (Si)  and  a  segment  s  £  Sj ,  let 


5(A,s)  : 


1  if  A  disappears  from  7 (Si)  when  ,v  is  removed  from  Sj, 
0  otherwise. 
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Chapter  6  In  the  analysis  of  the  query  time  we  observed  that  there  are  at  most  four  segments 
point  location  that  cause  a  given  trapezoid  to  disappear.  Hence, 

E  E  5(A,,K4|T(5;)|  =  O(0. 

seSi  AeT (s,) 

Now,  kj  is  the  number  of  trapezoids  created  by  the  insertion  of  Sj,  or,  equivalently, 
the  number  of  trapezoids  in  T(.S',)  that  disappear  when  ,v,  is  removed.  Since  ,v, 
is  a  random  element  of  Sj,  we  can  find  the  expected  value  of  kj  by  taking  the 
average  over  all  s  €  Sj~. 


m  =  - 1  E  5(A,J)<^  =  o(i). 

1  seSj  a eT(Sj)  1 


We  conclude  that  the  expected  number  of  newly  created  trapezoids  is  0(1)  in 
every  iteration  of  the  algorithm,  from  which  the  0(n )  bound  on  the  expected 
amount  of  storage  follows. 

It  remains  to  bound  the  expected  running  time  of  the  construction  algorithm. 
Given  the  analysis  of  the  query  time  and  storage,  this  is  easy.  We  only  need  to 
observe  that  the  time  to  insert  segment  s,-  is  0(kj)  plus  the  time  needed  to  locate 
the  left  endpoint  of  Sj  in  T(5,-_i).  Using  the  earlier  derived  bounds  on  kj  and  the 
query  time,  we  immediately  get  that  the  expected  running  time  of  the  algorithm 
is 

0(1)  +  E{°(1osO  +  °(eM)}  =  0(n  logn). 

This  completes  the  proof.  Li 

Note  once  again  that  the  expectancy  in  Theorem  6.3  is  solely  with  respect 
to  the  random  choices  made  by  the  algorithm;  we  do  not  average  over  possible 
choices  for  the  input.  Hence,  there  are  no  bad  inputs:  for  any  input  set  of  n  line 
segments,  the  expected  running  time  of  the  algorithm  is  0(n  logn). 

As  discussed  earlier.  Theorem  6.3  does  not  guarantee  anything  about  the 
expected  maximum  query  time  over  all  possible  query  points.  In  Section  6.4. 
however,  it  is  proved  that  the  expected  maximum  query  time  is  G(logn)  as  well. 
Hence,  we  can  build  a  data  structure  of  expected  size  0(n ),  whose  expected 
query  time  is  (9(log«).  This  also  proves  the  existence  of  a  data  structure  with 
0(n)  size  and  O(logn)  query  time  for  any  query  point  q — see  Theorem  6.8. 

Finally  we  go  back  to  our  original  problem:  point  location  in  a  planar  subdivi¬ 
sion  S.  We  assume  that  S  is  given  as  a  doubly-connected  edge  list  with  n  edges. 
We  use  algorithm  TrapezoidalMap  to  compute  a  search  structure  D  for  the 
trapezoidal  map  of  the  edges  of  S.  To  use  this  search  structure  for  point  location 
in  §,  however,  we  still  need  to  attach  to  every  leaf  of  CD  a  pointer  to  the  face  / 
of  S  that  contains  the  trapezoid  of  7(S)  corresponding  to  that  leaf.  But  this 
is  rather  easy:  recall  from  Chapter  2  that  the  doubly-connected  edge  list  of  § 
stores  with  every  half-edge  a  pointer  to  the  face  that  is  incident  to  the  left.  The 
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face  that  is  incident  to  the  right  can  be  found  in  constant  time  from  Twin(e).  So 
for  every  trapezoid  A  of  T(S)  we  simply  look  at  the  face  of  8  incident  to  top(A) 
from  below.  If  top( A)  is  the  top  edge  of  R ,  then  A  is  contained  in  the  unique 
unbounded  face  of  8. 

In  the  next  section  we  show  that  the  assumption  that  the  segments  be  in 
general  position  may  be  dropped,  which  leads  to  a  less  restricted  version  of 
Theorem  6.3.  It  implies  the  following  corollary. 

Corollary  6.4  Let  8  be  a  planar  subdivision  with  n  edges.  In  0(n  log  n)  expected 
time  one  can  construct  a  data  structure  that  uses  0(n)  expected  storage,  such  that 
for  any  query  point  q,  the  expected  time  for  a  point  location  query  is  O(k)gn). 


6.3  Dealing  with  Degenerate  Cases 

In  the  previous  sections  we  made  two  simplifying  assumptions.  First  of  all,  we 
assumed  that  the  set  of  line  segments  was  in  general  position,  meaning  that  no 
two  distinct  endpoints  have  the  same  x-coordinate.  Secondly,  we  assumed  that  a 
query  point  never  lies  on  the  vertical  line  of  an  x-node  on  its  search  path,  nor  on 
the  segment  of  a  v-nodc.  We  now  set  out  to  get  rid  of  these  assumptions. 

We  first  show  how  to  avoid  the  assumption  that  no  two  distinct  endpoints  lie  on 
a  common  vertical  line.  The  crucial  observation  is  that  the  vertical  direction 
chosen  to  define  the  trapezoidal  map  of  the  set  of  line  segments  was  immaterial. 
Therefore  we  can  rotate  the  axis-system  slightly.  If  the  rotation  angle  is  small 
enough,  then  no  two  distinct  endpoints  will  lie  on  the  same  vertical  line  anymore. 
Rotations  by  very  small  angles,  however,  lead  to  numerical  difficulties.  Even  if 
the  input  coordinates  are  integer,  we  need  a  significantly  higher  precision  to  do 
the  calculations  properly.  A  better  approach  is  to  do  the  rotation  symbolically.  In 
Chapter  5  we  have  seen  another  form  of  a  symbolic  transformation,  composite 
numbers,  which  was  used  to  deal  with  the  case  where  data  points  have  the  same 
x-  or  y-coordinate.  In  this  chapter,  we  will  have  another  look  at  such  a  symbolic 
perturbation ,  and  will  try  to  interpret  it  geometrically. 


It  will  be  convenient  not  to  use  rotation,  but  to  use  an  affine  mapping  called 
shear  transformation.  In  particular,  we  shall  use  the  shear  transformation  along 
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Figure  6. 9 

The  shear  transformation 
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Chapter  6  the  x-axis  by  some  value  e  >  0: 

POINT  LOCATION 


x  +  e>A 

.  y  ) 


Figure  6.9  illustrates  the  effect  of  this  shear  transform.  It  maps  a  vertical  line 
to  a  line  with  slope  1  /e,  so  that  any  two  distinct  points  on  the  same  vertical 
line  are  mapped  to  points  with  distinct  x-coordinates.  Furthermore,  if  e  >  0  is 
small  enough,  then  the  transformation  does  not  reverse  the  order  in  x-direction 
of  the  given  input  points.  It  is  not  difficult  to  compute  an  upper  bound  for  e  to 
guarantee  that  this  property  holds.  In  the  following,  we  will  assume  that  we 
have  a  value  of  e  >  0  available  that  is  small  enough  that  the  order  of  points  is 
conserved  by  the  shear  transform.  Surprisingly,  we  will  later  realize  that  we  do 
not  need  to  compute  such  an  actual  value  for  e. 


Given  a  set  S  of  n  arbitrary  non-crossing  line  segments,  we  will  run  algorithm 
TrapezoidalMap  on  the  set  (pS  :=  { (ps :  s  €  5}.  As  noted  earlier,  however, 
actually  performing  the  transformation  may  lead  to  numerical  problems,  and 
therefore  we  use  a  trick:  a  point  (pp  =  (x+  £y,y)  will  simply  be  stored  as  (x,y). 
This  is  a  unique  representation.  We  only  need  to  make  sure  that  the  algorithm 
treats  the  segments  represented  in  this  way  correctly.  Here  it  comes  handy 
that  the  algorithm  does  not  compute  any  geometric  objects:  we  never  actually 
compute  the  coordinates  of  the  endpoints  of  vertical  extensions,  for  instance. 
All  it  does  is  to  apply  two  types  of  elementary  operations  to  the  input  points. 
The  first  operation  takes  two  distinct  points  p  and  q  and  decides  whether  q  lies 
to  the  left,  to  the  right,  or  on  the  vertical  line  through  p.  The  second  operation 
takes  one  of  the  input  segments,  specified  by  its  two  endpoints  p\  and  pi,  and 
tests  whether  a  third  point  q  lies  above,  below,  or  on  this  segment.  This  second 
operation  is  only  applied  when  we  already  know  that  a  vertical  line  through  q 
intersects  the  segment.  All  the  points  p,  q,  p\,  and  p2  are  endpoints  of  segments 
in  the  input  set  S.  (You  may  want  to  go  through  the  description  of  the  algorithm 
again,  verifying  that  it  can  indeed  be  realized  with  these  two  operations  alone.) 

Let’s  look  at  how  to  apply  the  first  operation  to  two  transformed  points 
(pp  and  (pq.  These  points  have  coordinates  (xp  +  £yp,yp)  and  (xq  +  £yq,yq), 
respectively.  If  xq  /  xp,  then  the  relation  between  xq  and  xp  determines  the 
outcome  of  the  test — after  all,  we  had  chosen  e  to  have  this  property.  If  xq  =  xp, 
then  the  relation  between  yq  and  yp  decides  the  horizontal  order  of  the  points. 
Therefore  there  is  a  strict  horizontal  order  for  any  pair  of  distinct  points.  It 
follows  that  (pq  will  never  lie  on  the  vertical  line  through  (pp ,  except  when  p 
and  q  coincide.  But  this  is  exactly  what  we  need,  since  no  two  distinct  input 
points  should  have  the  same  x-coordinate. 

For  the  second  operation,  we  are  given  a  segment  (ps  with  endpoints  (pp\  = 
(xi  +£yi,yi)  and  (pp2  =  (x2  +  ey2,y2),  and  we  want  to  test  whether  a  point  (pq  = 
(x+  £y,y)  lies  above,  below,  or  on  (ps.  The  algorithm  ensures  that  whenever  we 
do  this  test,  the  vertical  line  through  (pq  intersects  (ps.  In  other  words, 
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x\  +£y i  <  x+ £y  ^  X2  +  ey2- 


This  implies  that  x\  ^  x  ^  xi-  Moreover,  if  x  =  x\  then  y  ^  y  i,  and  if  x  =  xj 
then  v  ^  yi-  Let’s  now  distinguish  two  cases. 

If  x\  =  xj,  the  untransformed  segment  s  is  vertical.  Since  now  x\  =  x  =  X2, 
we  have  yi  ^  y  ^  y2,  which  implies  that  q  lies  on  s.  Because  the  affine  mapping 
( p  is  incidence  preserving — if  two  points  coincide  before  the  transformation, 
they  do  so  afterwards — we  can  conclude  that  (pq  lies  on  (ps. 

Now  consider  the  case  where  x\  <  xj.  Since  we  already  know  that  the 
vertical  line  through  (pq  intersects  (ps,  it  is  good  enough  to  do  the  test  with  (ps. 
Now  we  observe  that  the  mapping  (p  preserves  the  relation  between  points  and 
lines:  if  a  point  is  above  (or  on,  or  below)  a  given  line,  then  the  transformed 
point  is  above  (or  on,  or  below)  the  transformed  line.  Hence,  we  can  simply 
perform  the  test  with  the  untransformed  point  q  and  segment  s. 

This  shows  that  to  run  the  algorithm  on  (pS  instead  of  on  .S’,  the  only  mod¬ 
ification  we  have  to  make  is  to  compare  points  lexicographically,  when  we 
want  to  determine  their  horizontal  order.  Of  course,  the  algorithm  will  com¬ 
pute  the  trapezoidal  map  for  (pS ,  and  a  search  structure  for  T ((pS).  Note  that, 
as  promised,  we  never  actually  needed  the  value  of  e,  so  there  is  no  need  to 
compute  such  a  value  at  the  beginning.  All  we  needed  was  that  e  is  small 
enough. 

Using  our  shear  transformation,  we  got  rid  of  the  assumption  that  any  two 
distinct  endpoints  should  have  distinct  x-coordinates.  What  about  the  restriction 
that  a  query  point  never  lies  on  the  vertical  line  of  an  x-node  on  the  search  path, 
nor  on  the  segment  of  a  y-node?  Our  approach  solves  this  problem  as  well,  as 
we  show  next. 

Since  the  constructed  search  structure  is  for  the  transformed  map  T(<pS), 
we  will  also  have  to  use  the  transformed  query  point  (pq  when  doing  a  query. 
In  other  words,  all  the  comparisons  we  have  to  do  during  the  search  must  be 
done  in  the  transformed  space.  We  already  know  how  to  do  the  tests  in  the 
transformed  space: 

At  x-nodes  we  have  to  do  the  test  lexicographically.  As  a  result,  no  two 
distinct  points  will  ever  lie  on  a  vertical  line.  (Trick  question:  How  can  this  be 
true,  if  the  transformation  (p  is  bijective?)  This  does  not  mean  that  the  outcome 
of  the  test  at  an  x-node  is  always  “to  the  right’’  or  “to  the  left’’.  The  outcome 
can  also  be  “on  the  line”.  This  can  only  happen,  however,  when  the  query  point 
coincides  with  the  endpoint  stored  at  the  node — and  this  answers  the  query! 

At  y-nodes  we  have  to  test  the  transformed  query  point  against  a  transformed 
segment.  The  test  described  above  can  have  three  outcomes:  “above”,  “below”, 
or  “on”.  In  the  first  two  cases,  there  is  no  problem,  and  we  can  descend  to  the 
corresponding  child  of  the  y-node.  If  the  outcome  of  the  test  is  “on”,  then  the 
untransformed  point  lies  on  the  untransformed  segment  as  well,  and  we  can 
report  that  fact  as  the  answer  to  the  query. 


Section  6.3 

DEALING  WITH  DEGENERATE  CASES 


We  have  generalized  Theorem  6.3  to  arbitrary  sets  of  non-crossing  line  segments. 

Theorem  6.5  Algorithm  TrapezoidalMap  computes  the  trapezoidal  map 
‘J (S)  of  a  set  S  of  n  non-crossing  line  segments  and  a  search  structure  I)  for 
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Chapter  6 

POINT  LOCATION 


‘.T (.S’ j  in  0(n\ogn)  expected  time.  The  expected  size  of  the  search  structure  is 
0(n )  and  for  any  query  point  q  the  expected  query  time  is  <9(logn). 


6.4*  A  Tail  Estimate 


Theorem  6.5  states  that  for  any  query  point  q,  the  expected  query  time  is 
O(\ogn).  This  is  a  rather  weak  result.  In  fact,  there  is  no  reason  to  expect  that 
the  maximum  query  time  of  the  search  structure  is  small:  it  may  be  that  for  any 
permutation  of  the  set  of  segments  the  resulting  search  structure  has  a  bad  query 
time  for  some  query  point.  In  this  section  we  shall  prove  that  there  is  no  need  to 
worry:  the  probability  that  the  maximum  query  time  is  bad  is  very  small.  To 
this  end  we  first  prove  the  following  high-probability  bound. 

Lemma  6.6  Let  S  be  a  set  of  n  non-crossing  line  segments,  let  q  be  a  query 
point,  and  let  A  be  a  parameter  with  A  >  0.  Then  the  probability  that  the  search 
path  for  q  in  the  search  structure  computed  by  Algorithm  TrapezoidalMap 
has  more  than  3A  In (n  +  1)  nodes  is  at  most  1  /(»  +  1)3.  in  1.25-1. 


{1,2, 3, 4} 


Proof.  We  would  like  to  define  random  variables  X,,  with  1  ^  i  ^  n,  such  that 
X,  is  1  if  at  least  one  node  on  the  search  path  for  q  has  been  created  in  iteration  i 
of  the  algorithm,  and  0  if  no  such  node  has  been  created.  Unfortunately,  the 
random  variables  defined  this  way  are  not  independent.  (We  did  not  need 
independence  in  the  proof  of  Theorem  6.3,  but  we  shall  need  it  now.)  Therefore, 
we  use  a  little  trick. 

We  define  a  directed  acyclic  graph  9,  with  one  source  and  one  sink.  Paths 
in  S  from  the  source  to  the  sink  correspond  to  the  permutations  of  5.  The  graph 
S  is  defined  as  follows.  There  is  a  node  for  every  subset  of  S,  including  one 
for  the  empty  set.  With  a  slight  abuse  of  terminology,  we  shall  often  speak  of 
“the  subset  S'”  when  we  mean  “the  node  corresponding  to  the  subset  S'  ”.  It  is 
convenient  to  imagine  the  nodes  as  being  grouped  into  n  +  1  layers,  such  that 
layer  i  contains  the  subsets  of  cardinality  i.  Notice  that  layers  0  and  n  both 
have  exactly  one  node,  which  correspond  to  the  empty  set  and  to  the  set  S, 
respectively.  A  node  in  layer  i  has  outgoing  arcs  to  some  of  the  nodes  in  layer 
i  +  1 .  More  precisely,  a  subset  S'  of  cardinality  i  has  outgoing  arcs  to  the  subsets 
S"  of  cardinality  i  +  1  if  and  only  if  that  S'  C  S" .  In  other  words,  a  subset  S'  has 
an  outgoing  arc  to  a  subset  S"  if  S"  can  be  obtained  by  adding  one  segment  of 
S  to  S'.  We  label  the  arc  with  this  segment.  Note  that  a  subset  S'  in  layer  i  has 
exactly  i  incoming  arcs,  each  labeled  with  a  segment  in  S',  and  exactly  n  —  i 
outgoing  arcs,  each  labeled  with  a  segment  in  S  \  S'. 

Directed  paths  from  the  source  to  the  sink  in  9  now  correspond  one-to-one 
to  permutations  of  S,  which  correspond  to  possible  executions  of  the  algorithm 
TrapezoidalMap.  Consider  an  arc  of  9  from  subset  S'  in  layer  i  to  subset  S" 
in  layer  i  +  1.  Let  the  segment  ,v  be  the  label  of  the  arc.  The  arc  represents  the 
insertion  of  ,v  into  the  trapezoidal  map  of  S' .  We  mark  this  arc  if  this  insertion 
changes  the  trapezoid  containing  point  q.  To  be  able  to  say  something  about 
the  number  of  marked  arcs,  we  use  the  backwards  analysis  argument  from  the 
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proof  of  Theorem  6.3:  there  are  at  most  four  segments  that  change  the  trapezoid 
containing  q  when  they  are  removed  from  a  subset  S" .  This  means  that  any  node 
in  3  has  at  most  four  marked  incoming  arcs.  But  it  is  possible  that  a  node  has 
less  than  four  marked  incoming  arcs.  In  that  case,  we  simply  mark  some  other, 
arbitrary  incoming  arcs,  so  that  the  number  of  marked  incoming  arcs  is  exactly 
four.  For  nodes  in  the  first  three  layers,  which  have  less  than  four  incoming  arcs, 
we  mark  all  incoming  arcs. 

We  want  to  analyze  the  expected  number  of  steps  during  which  the  trapezoid 
containing  q  changes.  In  other  words,  we  want  to  analyze  the  expected  number 
of  marked  edges  on  a  source-to-sink  path  in  3-  To  this  end  we  define  the  random 
variable  Xj  as  follows: 


X;:= 


if  the  ;-th  arc  on  the  sink-to-source  path  in  3  is  marked, 
otherwise. 


Note  the  similarity  of  the  definition  of  Xj  and  the  definition  of  X,  on  page  134. 
The  i-th  arc  on  the  path  is  from  a  node  in  layer  i  —  1  to  a  node  in  layer  i,  and 
each  such  arc  is  equally  likely  to  be  the  i-th  arc.  Since  each  node  in  layer  i 
has  i  incoming  arcs,  exactly  four  of  which  are  marked  (assuming  i  (s  4),  this 
implies  that  Pr [Xj  =  1]  =  4/i  for  i  ^  4.  For  i  <  4  we  have  Pr[X;  =  1]  =  1  <  4 /i. 
Moreover,  we  note  that  the  Xj  are  independent  (unlike  the  random  variables  Xj 
defined  on  page  134). 

Let  Y  :=  J)"=|  Xj.  The  number  of  nodes  on  the  search  path  for  q  is  at  most 
3T,  and  we  will  bound  the  probability  that  Y  exceeds  A  ln(«  +1).  We  will  use 
Markov’s  inequality ,  which  states  that  for  any  nonnegative  random  variable  Z 
and  any  a  >  0  we  have 


Pr[Z  ^  a]  < 


E[Z] 

a 


So  for  any  f  >  0  we  have 


?r[Y^Un{n  +  l)}=Yr[etY  ^eaXn{-n+x'>]^e~lUn(n+l)E[e,Y}. 


Recall  that  the  expected  value  of  the  sum  of  random  variables  is  the  sum  of 
the  expected  values.  In  general  it  is  not  true  that  the  expected  value  of  a 
product  is  the  product  of  the  expected  values.  But  for  random  variables  that  are 
independent  it  is  true.  Our  X,  are  independent,  so  we  have 


E[e,Y ]  =  E[e^,x-}  =  E[f]e,Z;]  =  ]“[E[< 


tX  1 


If  we  choose  t  =  In  1 .25,  we  get 

E[etX‘]  <  =  (l  +  l/4)-  +  l  — -  =  1+-  =  — , 

l  l  l  l  l  l 


and  we  have 


\\E[etXi]  <  "+1 

i=  1 


1  2  n 


Section  6.4* 

A  TAIL  ESTIMATE 


11+  1. 
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Chapter  6  Putting  everything  together,  we  get  the  bound  we  want  to  prove: 

POINT  LOCATION 

Pr [Y  >  A  In (n  +  1)]  <  e~Xt^n+l \n  +  1)  =  /  "  +  \.  =  1  /(n  +  lW"1.  □ 

(, n+l)M 

We  use  this  lemma  to  prove  a  bound  on  the  expected  maximum  query  time. 

Lemma  6.7  Let  S  be  a  set  of  n  non-crossing  line  segments,  and  let  A  be  a 
parameter  with  A  >  0.  Then  the  probability  that  the  maximum  length  of  a 
search  path  in  the  structure  for  S  computed  by  Algorithm  TrapezoidalMap 
is  more  than  3A  ln(n  +  1)  is  at  most  2 /{n  +  i ln  1  -25—3 

Proof.  We  will  call  two  query  points  q  and  q'  equivalent  if  they  follow  the  same 
path  through  the  search  structure  CD.  Partition  the  plane  into  vertical  slabs  by 
passing  a  vertical  line  through  every  endpoint  of  S.  Partition  every  slab  into 
trapezoids  by  intersecting  it  with  all  possible  segments  in  S.  This  defines  a 
decomposition  of  the  plane  into  at  most  2(n  +  l)2  trapezoids.  Any  two  points 
lying  in  the  same  trapezoid  of  this  decomposition  must  be  equivalent  in  every 
possible  search  structure  for  S.  After  all,  the  only  comparisons  made  during  a 
search  are  the  test  whether  the  query  point  lies  to  the  left  or  to  the  right  of  the 
vertical  line  through  a  segment  endpoint,  and  the  test  whether  the  query  point 
lies  above  or  below  a  segment. 

This  implies  that  to  bound  the  maximum  length  of  a  search  path  in  CD,  it 
suffices  to  consider  the  search  paths  of  at  most  2{n  +  l)2  query  points,  one  in 
each  of  these  trapezoids.  By  Lemma  6.6,  the  probability  that  the  length  of  the 
search  path  for  a  fixed  point  q  exceeds  3A  ln(n+  1)  is  at  most  1  / (n  +  l)^lnl-25_1. 
In  the  worst  case,  the  probability  that  the  length  of  the  search  path  for  any  one  of 

the  2{n  +  l)2  test  points  exceeds  the  bound  is  therefore  at  most  2 (n  +  l)2/ (n  + 
j^Alnl.25-1  g 

This  lemma  implies  that  the  expected  maximum  query  time  is  O(k)gn). 
Take  for  instance  A  =  20.  Then  the  probability  that  the  maximum  length  of  a 
search  path  in  CD  is  more  than  3A  ln(n  +  1)  is  at  most  2/(n  +  l)14,  which  is  less 
than  1/4  for  n  >  4.  In  other  words,  the  probability  that  CD  has  a  good  query  time 
is  at  least  3 /4.  Similarly  one  can  show  that  the  probability  that  the  size  of  CD  is 
O(n)  is  at  least  3/4.  We  already  noted  that  if  the  maximum  length  of  a  search 
path  in  the  structure  is  (9(logn),  and  the  size  is  O(n),  then  the  running  time  of 
the  algorithm  is  O(nlogn).  Hence,  the  probability  that  the  query  time,  the  size, 
and  the  construction  time  are  good  is  at  least  1/2. 

Now  we  can  also  construct  a  search  structure  that  has  <9(logn)  worst-case 
query  time  and  uses  0(n)  space  in  the  worst  case.  What  we  do  is  the  following. 
We  run  Algorithm  TrapezoidalMap  on  the  set  S,  keeping  track  of  the  size  and 
the  maximum  length  of  a  query  path  of  the  search  structure  being  created.  As 
soon  as  the  size  exceeds  c\n,  or  the  depth  exceeds  C2logn,  for  suitably  chosen 
constants  c \  and  C2,  we  stop  the  algorithm,  and  start  it  again  from  the  beginning, 
with  a  fresh  random  permutation.  Since  the  probability  that  a  permutation  leads 
to  a  data  structure  with  the  desired  size  and  depth  is  at  least  1/4,  we  expect  to 
be  finished  in  four  trials.  (In  fact,  for  large  n  the  probability  is  almost  one,  so 
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NOTES  AND  COMMENTS 


The  constants  in  this  example  are  not  very  convincing — a  query  that  visits 
60 Inn  nodes  isn’t  really  so  attractive.  However,  much  better  constants  can  be 
proven  with  the  same  technique. 

The  theorem  above  does  not  say  anything  about  preprocessing  time.  Indeed, 
to  keep  track  of  the  maximum  length  of  a  query  path,  we  need  to  consider 
2 (n  +  l)2  test  points — see  Lemma  6.7.  It  is  possible  to  reduce  this  to  only 
0{n\ogn)  test  points,  which  makes  the  expected  preprocessing  time  0(nlog2n). 


6.5  Notes  and  Comments 

The  point  location  problem  has  a  long  history  in  computational  geometry.  Early 
results  are  surveyed  by  Preparata  and  Shamos  [323];  Snoeyink  [361]  gives 
a  more  recent  survey.  Of  all  the  methods  suggested  for  the  problem,  four 
basically  different  approaches  lead  to  optimal  <9(log  n)  search  time,  0(n)  storage 
solutions.  These  are  the  chain  method  by  Edelsbrunner  et  al.  [161],  which  is 
based  on  segment  trees  and  fractional  cascading  (see  also  Chapter  10),  the 
triangulation  refinement  method  by  Kirkpatrick  [236],  the  use  of  persistency 
by  Sarnak  and  Tarjan  [336]  and  Cole  [135],  and  the  randomized  incremental 
method  by  Mulmuley  [289].  Our  presentation  here  follows  Seidel’s  [345] 
presentation  and  analysis  of  Mulmuley’s  algorithm. 

Quite  a  lot  of  recent  research  has  gone  into  dynamic  point  location,  where 
the  subdivision  can  be  modified  by  adding  and  deleting  edges  [41,  1 15,  120,  19]. 
A  (now  somewhat  outdated)  survey  on  dynamic  point  location  is  given  by 
Chiang  and  Tamassia  [121], 

In  more  than  two  dimensions,  the  point  location  problem  is  still  not  fully 
resolved.  A  general  structure  for  convex  subdivisions  in  three  dimensions  is 
given  by  Preparata  and  Tamassia  [324].  One  can  also  use  dynamic  planar 
point-location  structures  together  with  persistency  techniques  to  obtain  a  static 
three-dimensional  point-location  structure  using  O(n\ogn)  storage  and  with 
0(log2«)  query  time  [361].  No  structure  with  linear  storage  and  (9(logn)  query 
time  is  known.  In  higher  dimensions,  efficient  point-location  structures  are 
only  known  for  special  subdivisions,  such  as  arrangements  of  hyperplanes  [95, 
104,  131].  Considering  the  subdivision  induced  by  a  set  H  of  n  hyperplanes  in 
cZ-dimensional  space,  it  is  well  known  that  the  combinatorial  complexity  of  this 
subdivision  (the  number  of  vertices,  edges,  and  so  on)  is  &(nd)  in  the  worst  case 
[158] — see  also  the  notes  and  comments  of  Chapter  8.  Chazelle  and  Friedman 
[104]  have  shown  that  such  subdivisions  can  be  stored  using  0(nd)  space,  such 
that  point  location  queries  take  O(\ogn)  time.  Other  special  subdivisions  that 
allow  for  efficient  point  location  are  convex  polytopes  [131,  266],  arrangements 
of  triangles  [59],  and  arrangements  of  algebraic  varieties  [102]. 


the  expected  number  of  trials  is  only  slightly  larger  than  one.)  This  leads  to  the 
following  result. 

Theorem  6.8  Let  8  be  a  planar  subdivision  with  n  edges.  There  exists  a  point 
location  data  structure  forS  that  uses  0(n)  storage  and  has  0(logn )  query  time 
in  the  worst  case. 
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Chapter  6  Other  point  location  problems  in  3-  and  higher-dimensional  space  that  can 

point  location  be  solved  efficiently  include  those  where  assumptions  on  the  shape  of  the  cells 
are  made.  Two  examples  are  rectangular  subdivisions  [57,  162],  and  so-called 
fat  subdivisions  [51,  302,  309], 

Usually,  a  point  location  query  aks  for  the  label  of  the  cell  of  a  subdivision 
that  contains  a  given  query  point.  For  point  location  in  a  convex  polytope 
in  cZ-dimensional  space,  that  means  that  there  are  only  two  possible  answers: 
inside  the  polytope,  or  on  the  outside.  Therefore,  one  could  hope  for  point 
location  structures  that  require  considerably  less  storage  than  the  combinatorial 
complexity  of  the  subdivision.  This  can  be  as  much  as  0(nLrf/2J)  for  convex 
polytopes  defined  by  the  intersection  of  n  half-spaces  [158] — see  also  the  notes 
and  comments  of  Chapter  11.  Indeed,  an  0(n)  size  data  structure  exists  in 
which  queries  take  0(n1-1/ Lrf/2J  log0^  n)  time  [264].  For  arrangements  of  line 
segments  in  the  plane,  other  implicit  point  location  structures  have  been  given 
that  require  less  than  quadratic  space,  even  if  the  arrangement  of  line  segments 
has  quadratic  complexity  [7,  160]. 


6.6  Exercises 

6.1  Draw  the  graph  of  the  search  structure  T>  for  the  set  of  segments  depicted 
in  the  margin,  for  some  insertion  order  of  the  segments. 

6.2  Give  an  example  of  a  set  of  n  line  segments  with  an  order  on  them  that 
makes  the  algorithm  create  a  search  structure  of  size  ®(n2)  and  worst-case 
query  time  ©(h). 

6.3  In  this  chapter  we  have  looked  at  the  point  location  problem  with  pre¬ 
processing.  We  have  not  looked  at  the  single  shot  problem,  where  the 
subdivision  and  the  query  point  are  given  at  the  same  time,  and  we  have 
no  special  preprocessing  to  speed  up  the  searches.  In  this  exercise  and 
some  of  the  following  ones,  we  have  a  look  at  such  problems. 

Given  a  simple  polygon  CP  with  n  vertices  and  a  query  point  q ,  here  is 
an  algorithm  to  determine  whether  q  lies  in  CP.  Consider  the  ray  p  := 
{ (qx  +  X ,  qy)  :  A  >  0}  (this  is  the  horizontal  ray  starting  in  q  and  going 
rightwards).  Determine  for  every  edge  e  of  CP  whether  it  intersects  p.  If 
the  number  of  intersecting  edges  is  odd,  then  q  £  CP,  otherwise  q  f  CP. 

Prove  that  this  algorithm  is  correct,  and  explain  how  to  deal  with  degen¬ 
erate  cases.  (One  degenerate  case  is  when  p  intersects  an  endpoint  of 
an  edge.  Are  there  other  special  cases?)  What  is  the  running  time  of  the 
algorithm? 

6.4  Show  that,  given  a  planar  subdivision  S  with  n  vertices  and  edges  and  a 
query  point  q ,  the  face  of  S  containing  q  can  be  computed  in  time  O(n). 
Assume  that  S  is  given  in  a  doubly-connected  edge  list. 
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6.5  Given  a  convex  polygon  CP  as  an  array  of  its  n  vertices  in  sorted  order  Section  6.6 

along  the  boundary.  Show  that,  given  a  query  point  q ,  it  can  be  tested  in  exercises 

time  O(logn)  whether  q  lies  inside  CP. 

6.6  Given  a  y-monotone  polygon  CP  as  an  array  of  its  n  vertices  in  sorted 
order  along  the  boundary.  Can  you  generalize  the  solution  to  the  previous 
exercise  to  y-monotone  polygons? 

6.7  A  polygon  CP  is  called  star-shaped  if  a  point  p  in  the  interior  of  CP  exists 
such  that,  for  any  other  point  q  in  CP,  the  line  segment  ~pq  lies  in  CP.  Assume 
that  such  a  point  p  is  given  with  the  star-shaped  polygon  CP.  As  in  the 
previous  two  exercises  the  vertices  of  CP  are  given  in  sorted  order  along 
the  boundary  in  an  array.  Show  that,  given  a  query  point  q,  it  can  be  tested 
in  time  (9(log«)  whether  q  lies  inside  CP.  What  if  CP  is  star-shaped,  but  the 
point  p  is  not  given? 

6.8  Design  a  deterministic  algorithm,  that  is,  one  that  doesn’t  make  random 
choices,  to  compute  the  trapezoidal  map  of  a  set  of  non-crossing  line 
segments.  Use  the  plane  sweep  paradigm  from  Chapter  2.  The  worst-case 
running  time  of  the  algorithm  should  be  O(n\ogn). 

6.9*  Give  a  randomized  algorithm  to  compute  in  0(n  log n  +  A)  expected  time 
all  pairs  of  intersecting  segments  in  a  set  of  n  line  segments,  where  A  is 
the  number  of  intersecting  pairs. 

6.10  Design  an  algorithm  with  running  time  O(nlogn)  for  the  following  prob¬ 
lem:  Given  a  set  P  of  n  points,  determine  a  value  of  e  >  0  such  that  the 
shear  transformation  <p  :  (x.y)  i— >  (x  +  £y,  y)  does  not  change  the  order  (in 
x-direction)  of  points  with  unequal  x-coordinates. 

6.1 1  Let  S  be  a  set  of  non-crossing  segments  in  the  plane,  and  let  s  be  a  new 
segment  not  crossing  any  of  the  segments  in  S.  Prove  that  a  trapezoid  A 
of  7(S)  is  also  a  trapezoid  of  T(5U  {.?})  if  and  only  if  s  does  not  intersect 
the  interior  of  A. 

6.12  Prove  that  the  number  of  inner  nodes  of  the  search  structure  CD  of  algo¬ 
rithm  TrapezoidalMap  increases  by  k,  —  1  in  iteration  i,  where  k,  is 
the  number  of  new  trapezoids  in  7 (Si)  (and  hence  the  number  of  new 
leaves  of  CD). 

6.13  Use  a  plane  sweep  argument  to  prove  that  the  trapezoidal  map  of  n  line 
segments  in  general  position  has  at  most  3 n  +  1  trapezoids.  (Imagine 
a  vertical  line  sweeping  over  the  plane  from  left  to  right,  stopping  at 
all  endpoints  of  segments.  Count  the  number  of  trapezoids  that  are 
encountered  by  the  sweep  line.) 

6.14  We  have  defined  the  trapezoidal  map  of  a  set  S  of  n  line  segments  only  for 
the  case  that  S  is  in  general  position.  Give  a  definition  for  the  trapezoidal 
map  7(S)  of  an  arbitrary  set  of  segments.  Prove  that  the  upper  bound  of 
3 n  +  1  for  the  number  of  trapezoids  still  holds. 
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6.15  Although  we  have  started  with  the  point  location  problem  on  the  surface 
of  the  earth,  we  have  only  treated  planar  point  location.  But  the  earth  is 
a  globe.  How  would  you  define  a  spherical  subdivision — a  subdivision 
of  the  surface  of  a  sphere?  Give  a  point  location  structure  for  such  a 
subdivision. 

6.16  The  ray  shooting  problem  occurs  in  computer  graphics  (see  Chapter  8). 
A  2-dimensional  version  can  be  given  as  follows:  Store  a  set  S  of  n 
non-crossing  line  segments  such  that  one  can  quickly  answer  queries 
of  the  type:  “Given  a  query  ray  p — a  ray  is  a  half-line  starting  at  some 
point — find  the  first  segment  in  S  intersected  by  p.”  (We  leave  it  to  you  to 
define  the  behavior  for  degenerate  cases.) 

In  this  exercise,  we  look  at  vertical  ray  shooting,  where  the  query  ray 
must  be  a  vertical  ray  pointing  upwards.  Only  the  starting  point  need  be 
specified  in  such  a  query. 

Give  a  data  structure  for  the  vertical  ray  shooting  problem  for  a  set  S  of 
n  non-crossing  line  segments  in  general  position.  Bound  the  query  time 
and  storage  requirement  of  your  data  structure.  What  is  the  preprocessing 
time? 

Can  you  do  the  same  when  the  segments  are  allowed  to  intersect  each 
other? 

6.17*  Prove  a  version  of  Theorem  6.8  that  gives  upper  bounds  for  the  number  of 
nodes  and  the  depth  of  the  search  structure  without  using  order  notation 
((9-bounds).  The  details  in  the  text  that  lead  to  Theorem  6.8  should  be 
modified  so  that  it  gives  better  constants. 
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The  Post  Office  Problem 


Suppose  you  are  on  the  advisory  board  for  the  planning  of  a  supermarket  chain, 
and  there  are  plans  to  open  a  new  branch  at  a  certain  location.  To  predict  whether 
the  new  branch  will  be  profitable,  you  must  estimate  the  number  of  customers  it 
will  attract.  For  this  you  have  to  model  the  behavior  of  your  potential  customers: 
how  do  people  decide  where  to  do  their  shopping?  A  similar  question  arises  in 
social  geography,  when  studying  the  economic  activities  in  a  country:  what  is 
the  trading  area  of  certain  cities?  In  a  more  abstract  setting  we  have  a  set  of 


Figure  7.1 

The  trading  areas  of  the  capitals  of  the 
twelve  provinces  in  the  Netherlands,  as 
predicted  by  the  Voronoi  assignment 
model 


central  places — called  sites — that  provide  certain  goods  or  services,  and  we  want 
to  know  for  each  site  where  the  people  live  who  obtain  their  goods  or  services 
from  that  site.  (In  computational  geometry  the  sites  are  traditionally  viewed 
as  post  offices  where  customers  want  to  post  their  letters — hence  the  subtitle 
of  this  chapter.)  To  study  this  question  we  make  the  following  simplifying 
assumptions: 

■  the  price  of  a  particular  good  or  service  is  the  same  at  every  site; 

■  the  cost  of  acquiring  the  good  or  service  is  equal  to  the  price  plus  the  cost 
of  transportation  to  the  site; 
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■  the  cost  of  transportation  to  a  site  equals  the  Euclidean  distance  to  the  site 

times  a  fixed  price  per  unit  distance; 

■  consumers  try  to  minimize  the  cost  of  acquiring  the  good  or  service. 
Usually  these  assumptions  are  not  completely  satisfied:  goods  may  be  cheaper 
at  some  sites  than  at  others,  and  the  transportation  cost  between  two  points  is 
probably  not  linear  in  the  Euclidean  distance  between  them.  But  the  model 
above  can  give  a  rough  approximation  of  the  trading  areas  of  the  sites.  Areas 
where  the  behavior  of  the  people  differs  from  that  predicted  by  the  model  can 
be  subjected  to  further  research,  to  see  what  caused  the  different  behavior. 

Our  interest  lies  in  the  geometric  interpretation  of  the  model  above.  The  assump¬ 
tions  in  the  model  induce  a  subdivision  of  the  total  area  under  consideration 
into  regions — the  trading  areas  of  the  sites — such  that  the  people  who  live  in  the 
same  region  all  go  to  the  same  site.  Our  assumptions  imply  that  people  simply 
get  their  goods  at  the  nearest  site — a  fairly  realistic  situation.  This  means  that 
the  trading  area  for  a  given  site  consists  of  all  those  points  for  which  that  site  is 
closer  than  any  other  site.  Figure  7.1  gives  an  example.  The  sites  in  this  figure 
are  the  capitals  of  the  twelve  provinces  in  the  Netherlands. 

The  model  where  every  point  is  assigned  to  the  nearest  site  is  called  the 
Voronoi  assignment  model.  The  subdivision  induced  by  this  model  is  called  the 
Voronoi  diagram  of  the  set  of  sites.  From  the  Voronoi  diagram  we  can  derive 
all  kinds  of  information  about  the  trading  areas  of  the  sites  and  their  relations. 
For  example,  if  the  regions  of  two  sites  have  a  common  boundary  then  these 
two  sites  are  likely  to  be  in  direct  competition  for  customers  that  live  in  the 
boundary  region. 

The  Voronoi  diagram  is  a  versatile  geometric  structure.  We  have  described 
an  application  to  social  geography,  but  the  Voronoi  diagram  has  applications 
in  physics,  astronomy,  robotics,  and  many  more  fields.  It  is  also  closely  linked 
to  another  important  geometric  structure,  the  so-called  Delaunay  triangulation, 
which  we  shall  encounter  in  Chapter  9.  In  the  current  chapter  we  shall  confine 
ourselves  to  the  basic  properties  and  the  construction  of  the  Voronoi  diagram  of 
a  set  of  point  sites  in  the  plane. 


7.1  Definition  and  Basic  Properties 

Denote  the  Euclidean  distance  between  two  points  p  and  q  by  dist(p,g).  In  the 
plane  we  have 

dist (p,q)  :=  ^ (px  -  qx)2  +  (py  -  qy)2. 

Let  P  :=  {pi,P2,  ■  ■  ■  ,Pn }  be  a  set  of  n  distinct  points  in  the  plane;  these  points 
are  the  sites.  We  define  the  Voronoi  diagram  of  P  as  the  subdivision  of  the  plane 
into  n  cells,  one  for  each  site  in  P.  with  the  property  that  a  point  q  lies  in  the 
cell  corresponding  to  a  site  p,  if  and  only  if  dist ('(/./;,)  <  di stfry,  pt)  for  each 
pj  £  P  with  j  ^  i.  We  denote  the  Voronoi  diagram  of  P  by  Vor(P).  Abusing  the 
terminology  slightly,  we  will  sometimes  use  ‘Vor(P)’  or  ‘Voronoi  diagram’  to 
indicate  only  the  edges  and  vertices  of  the  subdivision.  For  example,  when  we 
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say  that  a  Voronoi  diagram  is  connected  we  mean  that  the  union  of  its  edges  and 
vertices  forms  a  connected  set.  The  cell  of  Vor(P)  that  corresponds  to  a  site  p, 
is  denoted  V(pi');  we  call  it  the  Voronoi  cell  of  (In  the  terminology  of  the 
introduction  to  this  chapter:  V (/?,-)  is  the  trading  area  of  site  /?,.) 

We  now  take  a  closer  look  at  the  Voronoi  diagram.  First  we  study  the 
structure  of  a  single  Voronoi  cell.  For  two  points  p  and  q  in  the  plane  we  define 
the  bisector  of  p  and  q  as  the  perpendicular  bisector  of  the  line  segment  Jfq.  This 
bisector  splits  the  plane  into  two  half-planes.  We  denote  the  open  half-plane 
that  contains  p  by  h(p.q)  and  the  open  half-plane  that  contains  q  by  h(q.p). 
Notice  that  r  £  h(p,q)  if  and  only  if  dist(r,p)  <  dist(r,g).  From  this  we  obtain 
the  following  observation. 

Observation  7.1  V(pt)  =  fl i^n,tfih{Pi,Pj)- 

Thus  V(pi')  is  the  intersection  of  n  —  1  half-planes  and,  hence,  a  (possibly 
unbounded)  open  convex  polygonal  region  bounded  by  at  most  n—  1  vertices 
and  at  most  n—  1  edges. 

What  does  the  complete  Voronoi  diagram  look  like?  We  just  saw  that  each 
cell  of  the  diagram  is  the  intersection  of  a  number  of  half-planes,  so  the  Voronoi 
diagram  is  a  planar  subdivision  whose  edges  are  straight.  Some  edges  are  line 
segments  and  others  are  half-lines.  Unless  all  sites  are  collinear  there  will  be  no 
edges  that  are  full  lines: 

Theorem  7.2  Let  P  be  a  set  of  n  point  sites  in  the  plane.  If  all  the  sites  are 
collinear  then  Vor (P)  consists  of  n  1  parallel  lines.  Otherwise,  Vor (P)  is 
connected  and  its  edges  are  either  segments  or  half-lines. 

Proof.  The  first  part  of  the  theorem  is  easy  to  prove,  so  assume  that  not  all  sites 
in  P  are  collinear. 

We  first  show  that  the  edges  of  Vor(P)  are  either  segments  or  half-lines.  We 
already  know  that  the  edges  of  Vor(P)  are  parts  of  straight  lines,  namely  parts  of 
the  bisectors  between  pairs  of  sites.  Now  suppose  for  a  contradiction  that  there 
is  an  edge  e  of  Vor(L)  that  is  a  full  line.  Let  e  be  on  the  boundary  of  the  Voronoi 
cells  V(pi)  and  V(pj).  Let  p^  £  P  be  a  point  that  is  not  collinear  with  p,  and  pj. 
The  bisector  of  pj  and  pk  is  not  parallel  to  e  and,  hence,  it  intersects  e.  But  then 
the  part  of  e  that  lies  in  the  interior  of  h(pi,,p  j)  cannot  be  on  the  boundary  of 
V(pj),  because  it  is  closer  to  pi  than  to  pj,  a  contradiction. 

It  remains  to  prove  that  Vor(P)  is  connected.  If  this  were  not  the  case 
then  there  would  be  a  Voronoi  cell  V(pi)  splitting  the  plane  into  two.  Because 
Voronoi  cells  are  convex,  V(pi)  would  consist  of  a  strip  bounded  by  two  parallel 
full  lines.  But  we  just  proved  that  the  edges  of  the  Voronoi  diagram  cannot  be 
full  lines,  a  contradiction.  0 


Section  7.1 

DEFINITION  AND  BASIC  PROPERTIES 


Now  that  we  understand  the  structure  of  the  Voronoi  diagram  we  investigate 
its  complexity,  that  is,  the  total  number  of  its  vertices  and  edges.  Since  there  are 
n  sites  and  each  Voronoi  cell  has  at  most  n  —  1  vertices  and  edges,  the  complexity 
of  Vor(P)  is  at  most  quadratic.  It  is  not  clear,  however,  whether  Vor(P)  can 

actually  have  quadratic  complexity:  it  is  easy  to  construct  an  example  where  _ 

a  single  Voronoi  cell  has  linear  complexity,  but  can  it  happen  that  many  cells  149 
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have  linear  complexity?  The  following  theorem  shows  that  this  is  not  the  case 
and  that  the  average  number  of  vertices  of  the  Voronoi  cells  is  less  than  six. 

Theorem  7.3  Form  f  3,  the  number  of  vertices  in  the  Voronoi  diagram  of  a  set 
of  n  point  sites  in  the  plane  is  at  most  2  n  —  5  and  the  number  of  edges  is  at  most 
3  n  —  6. 

Proof.  If  the  sites  are  all  collinear  then  the  theorem  immediately  follows  from 
Theorem  7.2,  so  assume  this  is  not  the  case.  We  prove  the  theorem  using  Euler’s 
formula,  which  states  that  for  any  connected  planar  embedded  graph  with  mv 
nodes,  me  arcs,  and  nif  faces  the  following  relation  holds: 

mv  —  me  +  nif  =  2. 

We  cannot  apply  Euler’s  formula  directly  to  Vor(P),  because  Vor(P)  has  half¬ 
infinite  edges  and  is  therefore  not  a  proper  graph.  To  remedy  the  situation  we 
add  one  extra  vertex  v,*,  “at  infinity”  to  the  set  of  vertices  and  we  connect  all 
half-infinite  edges  of  Vor(P)  to  this  vertex.  We  now  have  a  connected  planar 
graph  to  which  we  can  apply  Euler’s  formula.  We  obtain  the  following  relation 
between  nv,  the  number  of  vertices  of  Vor(P),  ne,  the  number  of  edges  of  Vor(P), 
and  n,  the  number  of  sites: 


(nv+ 1)  —  ne  +n  =  2.  (7.1) 

Moreover,  every  edge  in  the  augmented  graph  has  exactly  two  vertices,  so  if 
we  sum  the  degrees  of  all  vertices  we  get  twice  the  number  of  edges.  Because 
every  vertex,  including  v„,  has  degree  at  least  three  we  get 

2ne  ^  3(nv+  1).  (7.2) 

Together  with  equation  (7.1)  this  implies  the  theorem.  0 

We  close  this  section  with  a  characterization  of  the  edges  and  vertices  of  the 
Voronoi  diagram.  We  know  that  the  edges  are  parts  of  bisectors  of  pairs  of  sites 
and  that  the  vertices  are  intersection  points  between  these  bisectors.  There  is 
a  quadratic  number  of  bisectors,  whereas  the  complexity  of  the  Vor(P)  is  only 
linear.  Hence,  not  all  bisectors  define  edges  of  Vor(P)  and  not  all  intersections 
are  vertices  of  Vor(P).  To  characterize  which  bisectors  and  intersections  define 
features  of  the  Voronoi  diagram  we  make  the  following  definition.  For  a  point 
q  we  define  the  largest  empty  circle  of  q  with  respect  to  P,  denoted  by  Cp(q), 
as  the  largest  circle  with  q  as  its  center  that  does  not  contain  any  site  of  P  in 
its  interior.  The  following  theorem  characterizes  the  vertices  and  edges  of  the 
Voronoi  diagram. 

Theorem  7.4  For  the  Voronoi  diagram  Vor  (P)  of  a  set  of  points  P  the  following 
holds: 

(i)  A  point  q  is  a  vertex  ofVor(P)  if  and  only  if  its  largest  empty  circle  Cp(q) 
contains  three  or  more  sites  on  its  boundary. 

(ii)  The  bisector  between  sites  p,  and  pj  defines  an  edge  ofVor(P)  if  and  only 
if  there  is  a  point  q  on  the  bisector  such  that  Cp(q )  contains  both  pi  and  pj 
on  its  boundary  but  no  other  site. 


150 


Proof,  (i)  Suppose  there  is  a  point  q  such  that  Cp  (q)  contains  three  or  more  sites 
on  its  boundary.  Let  /?,,  pj,  and  p^  be  three  of  those  sites.  Since  the  interior 
of  Cp(q)  is  empty  q  must  be  on  the  boundary  of  each  of  V (/>,-),  V(pj),  and 
V(pk),  and  q  must  be  a  vertex  of  Vor(P). 

On  the  other  hand,  every  vertex  q  of  Vor(P)  is  incident  to  at  least  three 
edges  and.  hence,  to  at  least  three  Voronoi  cells  V(pi),  V(pj),  and  V(pk). 
Vertex  q  must  be  equidistant  to  pj,  pj,  and  pk  and  there  cannot  be  another 
site  closer  to  q ,  since  otherwise  V(pi),  V(pj),  and  V(pk)  would  not  meet  at  q. 
Hence,  the  interior  of  the  circle  with  ph  pj ,  and  pk  on  its  boundary  does  not 
contain  any  site. 

(ii)  Suppose  there  is  a  point  q  with  the  property  stated  in  the  theorem.  Since 
Cp(q)  does  not  contain  any  sites  in  its  interior  and  p,  and  pj  are  on  its 
boundary,  we  have  dist(^,^>,)  =  dist(g,/?;)  ^  dist(^,^)  for  all  1  ^  k  ^  n. 
It  follows  that  q  lies  on  an  edge  or  vertex  of  Vor(P).  The  first  part  of  the 
theorem  implies  that  q  cannot  be  a  vertex  of  Vor(P).  Hence,  q  lies  on  an  edge 
of  Vor(P),  which  is  defined  by  the  bisector  of  p,  and  pj. 

Conversely,  let  the  bisector  of  /?,  and  pj  define  a  Voronoi  edge.  The 
largest  empty  circle  of  any  point  q  in  the  interior  of  this  edge  must  contain  pj 
and  pj  on  its  boundary  and  no  other  sites.  0 
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7.2  Computing  the  Voronoi  Diagram 

In  the  previous  section  we  studied  the  structure  of  the  Voronoi  diagram.  We 
now  set  out  to  compute  it.  Observation  7.1  suggests  a  simple  way  to  do  this: 
for  each  site  /?,,  compute  the  common  intersection  of  the  half-planes  h(pj.pj), 
with  j  /  i,  using  the  algorithm  presented  in  Chapter  4.  This  way  we  spend 
0(n  logn)  time  per  Voronoi  cell,  leading  to  an  0{n 2  logn)  algorithm  to  compute 
the  whole  Voronoi  diagram.  Can’t  we  do  better?  After  all,  the  total  complexity 
of  the  Voronoi  diagram  is  only  linear.  The  answer  is  yes:  the  plane  sweep 
algorithm  described  below — commonly  known  as  Fortune ’s  algorithm  after 
its  inventor — computes  the  Voronoi  diagram  in  0{n\ogn)  time.  You  may  be 
tempted  to  look  for  an  even  faster  algorithm,  for  example  one  that  runs  in  linear 
time.  This  turns  out  to  be  too  much  to  ask:  the  problem  of  sorting  n  real  numbers 
is  reducible  to  the  problem  of  computing  Voronoi  diagrams,  so  any  algorithm 
for  computing  Voronoi  diagrams  must  take  Cl(n\ogn)  time  in  the  worst  case. 

Hence,  Fortune’s  algorithm  is  optimal. 

The  strategy  in  a  plane  sweep  algorithm  is  to  sweep  a  horizontal  line — the 
sweep  line — from  top  to  bottom  over  the  plane.  While  the  sweep  is  performed 
information  is  maintained  regarding  the  structure  that  one  wants  to  compute. 

More  precisely,  information  is  maintained  about  the  intersection  of  the  structure 
with  the  sweep  line.  While  the  sweep  line  moves  downwards  the  information 
does  not  change,  except  at  certain  special  points — the  event  points. 

Let’s  try  to  apply  this  general  strategy  to  the  computation  of  the  Voronoi  diagram  _ 

of  a  set  P  =  {pi,P2,  ■  ■  ■  i Pn}  °f  point  sites  in  the  plane.  According  to  the  plane  151 
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sweep  paradigm  we  move  a  horizontal  sweep  line  £  from  top  to  bottom  over 
the  plane.  The  paradigm  involves  maintaining  the  intersection  of  the  Voronoi 
diagram  with  the  sweep  line.  Unfortunately  this  is  not  so  easy,  because  the  part 
of  Vor(P)  above  £  depends  not  only  on  the  sites  that  lie  above  £  but  also  on  sites 
below  £.  Stated  differently,  when  the  sweep  line  reaches  the  topmost  vertex 
of  the  Voronoi  cell  V(pi)  it  has  not  yet  encountered  the  corresponding  site  /?,. 
Hence,  we  do  not  have  all  the  information  needed  to  compute  the  vertex.  We  are 
forced  to  apply  the  plane  sweep  paradigm  in  a  slightly  different  fashion:  instead 
of  maintaining  the  intersection  of  the  Voronoi  diagram  with  the  sweep  line,  we 
maintain  information  about  the  part  of  the  Voronoi  diagram  of  the  sites  above  £ 
that  cannot  be  changed  by  sites  below  £. 

Denote  the  closed  half-plane  above  £  by  £+.  What  is  the  part  of  the  Voronoi 
diagram  above  £  that  cannot  be  changed  anymore?  In  other  words,  for  which 
points  q  £  £+  do  we  know  for  sure  what  their  nearest  site  is?  The  distance  of 
a  point  q  £  £+  to  any  site  below  £  is  greater  than  the  distance  of  q  to  £  itself. 
Hence,  the  nearest  site  of  q  cannot  lie  below  £  if  q  is  at  least  as  near  to  some  site 
Pi  £  £+  as  q  is  to  £.  The  locus  of  points  that  are  closer  to  some  site  pi  £  £+  than 
to  £  is  bounded  by  a  parabola.  Hence,  the  locus  of  points  that  are  closer  to  any 
site  above  £  than  to  £  itself  is  bounded  by  parabolic  arcs.  We  call  this  sequence 
of  parabolic  arcs  the  beach  line.  Another  way  to  visualize  the  beach  line  is  the 
following.  Every  site  p,  above  the  sweep  line  defines  a  complete  parabola  j3,. 
The  beach  line  is  the  function  that — for  each  x-coordinate — passes  through  the 
lowest  point  of  all  parabolas. 

Observation  7.5  The  beach  line  is  x-monotone,  that  is,  every  vertical  line 
intersects  it  in  exactly  one  point. 


It  is  easy  to  see  that  one  parabola  can  contribute  more  than  once  to  the  beach 
line.  Well  worry  later  about  how  many  pieces  there  can  be.  Notice  that  the 
breakpoints  between  the  different  parabolic  arcs  forming  the  beach  line  lie  on 
edges  of  the  Voronoi  diagram.  This  is  not  a  coincidence:  the  breakpoints  exactly 
trace  out  the  Voronoi  diagram  while  the  sweep  line  moves  from  top  to  bottom. 
These  properties  of  the  beach  line  can  be  proved  using  elementary  geometric 
arguments. 

So,  instead  of  maintaining  the  intersection  of  Vor(P)  with  l  we  maintain 
the  beach  line  as  we  move  our  sweep  line  l.  We  do  not  maintain  the  beach  line 
explicitly,  since  it  changes  continuously  as  £  moves.  For  the  moment  let’s  ignore 
the  issue  of  how  to  represent  the  beach  line  until  we  understand  where  and  how 
its  combinatorial  structure  changes.  This  happens  when  a  new  parabolic  arc 
appears  on  it,  and  when  a  parabolic  arc  shrinks  to  a  point  and  disappears. 


First  we  consider  the  events  where  a  new  arc  appears  on  the  beach  line.  One 
occasion  where  this  happens  is  when  the  sweep  line  £  reaches  a  new  site.  The 
parabola  defined  by  this  site  is  at  first  a  degenerate  parabola  with  zero  width:  a 
vertical  line  segment  connecting  the  new  site  to  the  beach  line.  As  the  sweep 

_  line  continues  to  move  downward  the  new  parabola  gets  wider  and  wider.  The 

152  part  of  the  new  parabola  below  the  old  beach  line  is  now  a  part  of  the  new  beach 


line.  Figure  7.2  illustrates  this  process.  We  call  the  event  where  a  new  site  is  Section  7.2 

encountered  a  site  event.  computing  the  voronoi  diagram 


Figure  7.2 

A  new  arc  appears  on  the  beach  line 
because  a  site  is  encountered 


What  happens  to  the  Voronoi  diagram  at  a  site  event?  Recall  that  the 
breakpoints  on  the  beach  line  trace  out  the  edges  of  the  Voronoi  diagram.  At  a 
site  event  two  new  breakpoints  appear,  which  start  tracing  out  edges.  In  fact, 
the  new  breakpoints  coincide  at  first,  and  then  move  in  opposite  directions  to 
trace  out  the  same  edge.  Initially,  this  edge  is  not  connected  to  the  rest  of  the 
Voronoi  diagram  above  the  sweep  line.  Later  on — we  will  see  shortly  exactly 
when  this  will  happen — the  growing  edge  will  run  into  another  edge,  and  it 
becomes  connected  to  the  rest  of  the  diagram. 

So  now  we  understand  what  happens  at  a  site  event:  a  new  arc  appears  on 
the  beach  line,  and  a  new  edge  of  the  Voronoi  diagram  starts  to  be  traced  out. 
Is  it  possible  that  a  new  arc  appears  on  the  beach  line  in  any  other  way?  The 
answer  is  no: 

Lemma  7.6  The  only  way  in  which  a  new  arc  can  appear  on  the  beach  line  is 
through  a  site  event. 

Proof.  Suppose  for  a  contradiction  that  an  already  existing  parabola  Pj  defined 
by  a  site  pj  breaks  through  the  beach  line.  There  are  two  ways  in  which  this 
could  happen. 

The  first  possibility  is  that  Pj  breaks  through  in  the  middle  of  an  arc  of  a 
parabola  J3,-.  The  moment  this  is  about  to  happen,  /3,  and  p  t  are  tangent,  that  is, 
they  have  exactly  one  point  of  intersection.  Let  £y  denote  the  y-coordinate  of  the 
sweep  line  at  the  moment  of  tangency.  If  pj  :=  {pj^iPj.y),  then  the  parabola  Pj 
is  given  by 

pj  :=  y=2(Pjl-ey)  +  + 

The  formula  for  Pi  is  similar,  of  course.  Using  that  both  /j/  v  and  /?,  ,  are  larger 
than  ly,  it  is  easy  to  show  that  it  is  impossible  that  pi  and  Pj  have  only  one  point 
of  intersection.  Hence,  a  parabola  j Bj  never  breaks  through  in  the  middle  of  an 
arc  of  another  parabola  Pi. 

The  second  possibility  is  that  /3;  appears  in  between  two  arcs.  Let  these 
arcs  be  part  of  parabolas  Pi  and  pk .  Let  q  be  the  intersection  point  of  Pi  and 
Pk  at  which  Pj  is  about  to  appear  on  the  beach  line,  and  assume  that  Pi  is  on 
the  beach  line  left  of  q  and  pk  is  on  the  beach  line  right  of  q ,  as  in  Figure  7.3. 
Then  there  is  a  circle  C  that  passes  through  /?,,  pj,  and  pk,  the  sites  defining 
the  parabolas.  This  circle  is  also  tangent  to  the  sweep  line  t.  The  cyclic  order 
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Chapter  7  on  C,  starting  at  the  point  of  tangency  with  £  and  going  clockwise,  is  Pi,Pj,Pk , 

voronoi  diagrams  because  fi  j  is  assumed  to  appear  in  between  the  arcs  of  /j,  and  /j/,.  Consider  an 

infinitesimal  motion  of  the  sweep  line  downward  while  keeping  the  circle  C 
tangent  to  i\  see  Figure  7.3.  Then  C  cannot  have  empty  interior  and  still  pass 


Figure  7.3 

The  situation  when  j3 j  would  appear  on 
the  beach  line,  and  the  circle  when  the 
sweep  line  has  proceeded 


i 


through  pf.  either  /?,  or  pk  will  penetrate  the  interior.  Therefore,  in  a  sufficiently 
small  neighborhood  of  q  the  parabola  /j,  cannot  appear  on  the  beach  line  when 
the  sweep  line  moves  downward,  because  either  /?,  or  pk  will  be  closer  to  t 
than  pj.  0 

An  immediate  consequence  of  the  lemma  is  that  the  beach  line  consists  of 
at  most  2 n  —  1  parabolic  arcs:  each  site  encountered  gives  rise  to  one  new  arc 
and  the  splitting  of  at  most  one  existing  arc  into  two,  and  there  is  no  other  way 
an  arc  can  appear  on  the  beach  line. 


The  second  type  of  event  in  the  plane  sweep  algorithm  is  where  an  existing  arc 
of  the  beach  line  shrinks  to  a  point  and  disappears,  as  in  Figure  7.4.  Let  a' 
be  the  disappearing  arc,  and  let  a  and  a"  be  the  two  neighboring  arcs  of  a' 
before  it  disappears.  The  arcs  a  and  a"  cannot  be  part  of  the  same  parabola; 
this  possibility  can  be  excluded  in  the  same  way  as  the  first  possibility  in  the 
proof  of  Lemma  7.6  was  excluded.  Hence,  the  three  arcs  a,  a ',  and  a"  are 
defined  by  three  distinct  sites  /?,,  pj.  and  pk .  At  the  moment  a'  disappears,  the 
parabolas  defined  by  these  three  sites  pass  through  a  common  point  q.  Point 

_  q  is  equidistant  from  £  and  each  of  the  three  sites.  Hence,  there  is  a  circle 

154  passing  through  pj,  and  pk  with  q  as  its  center  and  whose  lowest  point  lies 


on  i.  There  cannot  be  a  site  in  the  interior  of  this  circle:  such  a  site  would  be  Section  7.2 

closer  to  q  than  q  is  to  £,  contradicting  the  fact  that  q  is  on  the  beach  line.  It  computing  the  voronoi  diagram 

follows  that  the  point  q  is  a  vertex  of  the  Voronoi  diagram.  This  is  not  very 

surprising,  since  we  observed  earlier  that  the  breakpoints  on  the  beach  line  trace 

out  the  Voronoi  diagram.  So  when  an  arc  disappears  from  the  beach  line  and 

two  breakpoints  meet,  two  edges  of  the  Voronoi  diagram  meet  as  well.  We  call 

the  event  where  the  sweep  line  reaches  the  lowest  point  of  a  circle  through  three 

sites  defining  consecutive  arcs  on  the  beach  line  a  circle  event.  From  the  above 

we  can  conclude  the  following  lemma. 

Lemma  7.7  The  only  way  in  which  an  existing  arc  can  disappear  from  the  beach 
line  is  through  a  circle  event. 

Now  we  know  where  and  how  the  combinatorial  structure  of  the  beach  line 
changes:  at  a  site  event  a  new  arc  appears,  and  at  a  circle  event  an  existing 
arc  drops  out.  We  also  know  how  this  relates  to  the  Voronoi  diagram  under 
construction:  at  a  site  event  a  new  edge  starts  to  grow,  and  at  a  circle  event 
two  growing  edges  meet  to  form  a  vertex.  It  remains  to  find  the  right  data 
structures  to  maintain  the  necessary  information  during  the  sweep.  Our  goal 
is  to  compute  the  Voronoi  diagram,  so  we  need  a  data  structure  that  stores 
the  part  of  the  Voronoi  diagram  computed  thus  far.  We  also  need  the  two 
‘standard’  data  structures  for  any  sweep  line  algorithm:  an  event  queue  and  a 
structure  that  represents  the  status  of  the  sweep  line.  Here  the  latter  structure  is 
a  representation  of  the  beach  line.  These  data  structures  are  implemented  in  the 
following  way. 


■  We  store  the  Voronoi  diagram  under  construction  in  our  usual  data  struc¬ 
ture  for  subdivisions,  the  doubly-connected  edge  list.  A  Voronoi  diagram, 
however,  is  not  a  true  subdivision  as  defined  in  Chapter  2:  it  has  edges 
that  are  half-lines  or  full  lines,  and  these  cannot  be  represented  in  a  doubly- 
connected  edge  list.  During  the  construction  this  is  not  a  problem,  because 
the  representation  of  the  beach  line — described  next — will  make  it  possible 
to  access  the  relevant  parts  of  the  doubly-connected  edge  list  efficiently 
during  its  construction.  But  after  the  computation  is  finished  we  want  to 
have  a  valid  doubly-connected  edge  list.  To  this  end  we  add  a  big  bounding 
box  to  our  scene,  which  is  large  enough  so  that  it  contains  all  vertices  of 
the  Voronoi  diagram.  The  final  subdivision  we  compute  will  then  be  the 
bounding  box  plus  the  part  of  the  Voronoi  diagram  inside  it. 


■  The  beach  line  is  represented  by  a  balanced  binary  search  tree  7;  it  is  the 
status  structure.  Its  leaves  correspond  to  the  arcs  of  the  beach  line — which 
is  x-monotone — in  an  ordered  manner:  the  leftmost  leaf  represents  the 
leftmost  arc,  the  next  leaf  represents  the  second  leftmost  arc,  and  so  on. 

Each  leaf  ll  stores  the  site  that  defines  the  arc  it  represents.  The  internal 
nodes  of  T  represent  the  breakpoints  on  the  beach  line.  A  breakpoint  is 
stored  at  an  internal  node  by  an  ordered  tuple  of  sites  ( Pi,Pj ),  where  /?, 

defines  the  parabola  left  of  the  breakpoint  and  pj  defines  the  parabola  to  the  _ 

right.  Using  this  representation  of  the  beach  line,  we  can  find  in  O(logn)  155 


Chapter  7  time  the  arc  of  the  beach  line  lying  above  a  new  site.  At  an  internal  node, 

voronoi  diagrams  we  simply  compare  the  x-coordinate  of  the  new  site  with  the  x-coordinate 

of  the  breakpoint,  which  can  be  computed  from  the  tuple  of  sites  and  the 
position  of  the  sweep  line  in  constant  time.  Note  that  we  do  not  explicitly 
store  the  parabolas. 

In  T  we  also  store  pointers  to  the  other  two  data  structures  used  during 
the  sweep.  Each  leaf  of  T,  representing  an  arc  a,  stores  one  pointer  to  a 
node  in  the  event  queue,  namely,  the  node  that  represents  the  circle  event  in 
which  a  will  disappear.  This  pointer  is  nil  if  no  circle  event  exists  where  a 
will  disappear,  or  this  circle  event  hasn’t  been  detected  yet.  Finally,  every 
internal  node  v  has  a  pointer  to  a  half-edge  in  the  doubly-connected  edge 
list  of  the  Voronoi  diagram.  More  precisely,  v  has  a  pointer  to  one  of  the 
half-edges  of  the  edge  being  traced  out  by  the  breakpoint  represented  by  v. 
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■  The  event  queue  Q  is  implemented  as  a  priority  queue,  where  the  priority  of 
an  event  is  its  y- coordinate.  It  stores  the  upcoming  events  that  are  already 
known.  For  a  site  event  we  simply  store  the  site  itself.  For  a  circle  event  the 
event  point  that  we  store  is  the  lowest  point  of  the  circle,  with  a  pointer  to 
the  leaf  in  T  that  represents  the  arc  that  will  disappear  in  the  event. 

All  the  site  events  are  known  in  advance,  but  the  circle  events  are  not.  This 
brings  us  to  one  final  issue  that  we  must  discuss,  namely  the  detection  of  circle 
events. 

During  the  sweep  the  beach  line  changes  its  topological  structure  at  every 
event.  This  may  cause  new  triples  of  consecutive  arcs  to  appear  on  the  beach 
line  and  it  may  cause  existing  triples  to  disappear.  Our  algorithm  will  make  sure 
that  for  every  three  consecutive  arcs  on  the  beach  line  that  define  a  potential 
circle  event,  the  potential  event  is  stored  in  the  event  queue  Q.  There  are  two 
subtleties  involved  in  this.  First  of  all,  there  can  be  consecutive  triples  whose 
two  breakpoints  do  not  converge,  that  is,  the  directions  in  which  they  move  are 
such  that  they  will  not  meet  in  the  future;  this  happens  when  the  breakpoints 
move  along  two  bisectors  away  from  the  intersection  point.  In  this  case  the 
triple  does  not  define  a  potential  circle  event.  Secondly,  even  if  a  triple  has 
converging  breakpoints,  the  corresponding  circle  event  need  not  take  place:  it 
can  happen  that  the  triple  disappears  (for  instance  due  to  the  appearance  of  a 
new  site  on  the  beach  line)  before  the  event  has  taken  place.  In  this  case  we  call 
the  event  a  false  alarm. 

So  what  the  algorithm  does  is  this.  At  every  event,  it  checks  all  the  new 
triples  of  consecutive  arcs  that  appear.  For  instance,  at  a  site  event  we  can 
get  three  new  triples:  one  where  the  new  arc  is  the  left  arc  of  the  triple,  one 
where  it  is  the  middle  arc,  and  one  where  it  is  the  right  arc.  When  such  a  new 
triple  has  converging  breakpoints,  the  event  is  inserted  into  the  event  queue  Q. 
Observe  that  in  the  case  of  a  site  event,  the  triple  with  the  new  arc  being  the 
middle  one  can  never  cause  a  circle  event,  because  the  left  and  right  arc  of 
the  triple  come  from  the  same  parabola  and  therefore  the  breakpoints  must 
diverge.  Furthermore,  for  all  disappearing  triples  it  is  checked  whether  they 
have  a  corresponding  event  in  Q.  If  so,  the  event  is  apparently  a  false  alarm,  and 


it  is  deleted  from  Q.  This  can  easily  be  done  using  the  pointers  we  have  from  Section  7.2 

the  leaves  in  T  to  the  corresponding  circle  events  in  Q.  computing  the  voronoi  diagram 

Lemma  7.8  Every  Voronoi  vertex  is  detected  by  means  of  a  circle  event. 

Proof.  For  a  Voronoi  vertex  q ,  let  p,-,  p:,  and  p *  be  the  three  sites  through  which 
a  circle  C(pi,pj,pk)  passes  with  no  sites  in  the  interior.  By  Theorem  7.4,  such  a 
circle  and  three  sites  indeed  exist.  For  simplicity  we  only  prove  the  case  where 
no  other  sites  lie  on  C(p,1pj1pk ),  and  the  lowest  point  of  C(pi,pj,pk)  is  not 
one  of  the  defining  sites.  Assume  without  loss  of  generality  that  from  the  lowest 
point  of  C(pi1pj1pic),  the  clockwise  traversal  of  C(p,.p j.pf)  encounters  the 
sites  Pi,Pj,Pk  in  this  order. 

We  must  show  that  just  before  the  sweep  line  reaches  the  lowest  point  of 
C(pj.pj.pk),  there  are  three  consecutive  arcs  a,  a'  and  a "  on  the  beach  line 
defined  by  the  sites  pj,  pj ,  and  pk-  Only  then  will  the  circle  event  take  place. 

Consider  the  sweep  line  an  infinitesimal  amount  before  it  reaches  the  lowest 
point  of  C(pi,pj,pk).  Since  C(pj,pj,p/c)  doesn’t  contain  any  other  sites  inside 
or  on  it,  there  exists  a  circle  through  pt  and  pj  that  is  tangent  to  the  sweep  line, 
and  doesn’t  contain  sites  in  the  interior.  So  there  are  adjacent  arcs  on  the  beach 
line  defined  by  /?,  and  pj.  Similarly,  there  are  adjacent  arcs  on  the  beach  line 
defined  by  pj  and  pk-  It  is  easy  to  see  that  the  two  arcs  defined  by  pj  are  actually 
the  same  arc,  and  it  follows  that  there  are  three  consecutive  arcs  on  the  beach 
line  defined  by  pj,  and  pk-  Therefore,  the  corresponding  circle  event  is  in  Q 
just  before  the  event  takes  place,  and  the  Voronoi  vertex  is  detected.  0 

We  can  now  describe  the  plane  sweep  algorithm  in  detail.  Notice  that  after 
all  events  have  been  handled  and  the  event  queue  Q  is  empty,  the  beach  line 
hasn’t  disappeared  yet.  The  breakpoints  that  are  still  present  correspond  to  the 
half-infinite  edges  of  the  Voronoi  diagram.  As  stated  earlier,  a  doubly-connected 
edge  list  cannot  represent  half-infinite  edges,  so  we  must  add  a  bounding  box 
to  the  scene  to  which  these  edges  can  be  attached.  The  overall  structure  of  the 
algorithm  is  as  follows. 

Algorithm  VoronoiDiagram(P) 

Input.  A  set  P  :=  {p\, . . . ,  p„ }  of  point  sites  in  the  plane. 

Output.  The  Voronoi  diagram  Vor(P)  given  inside  a  bounding  box  in  a  doubly- 
connected  edge  list  T>. 

1 .  Initialize  the  event  queue  Q  with  all  site  events,  initialize  an  empty  status 
structure  T  and  an  empty  doubly-connected  edge  list  T>. 

2.  while  Q  is  not  empty 

3.  do  Remove  the  event  with  largest  y-coordinate  from  Q. 

4.  if  the  event  is  a  site  event,  occurring  at  site  pt 

5.  then  Handles  iteEvent(p,) 

6.  else  HandleCircleEvent(7),  where  y  is  the  leaf  of  T  repre¬ 

senting  the  arc  that  will  disappear 

7.  The  internal  nodes  still  present  in  T  correspond  to  the  half-infinite  edges  of 
the  Voronoi  diagram.  Compute  a  bounding  box  that  contains  all  vertices  of 
the  Voronoi  diagram  in  its  interior,  and  attach  the  half-infinite  edges  to  the 
bounding  box  by  updating  the  doubly-connected  edge  list  appropriately. 
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Chapter  7  8.  Traverse  the  half-edges  of  the  doubly-connected  edge  list  to  add  the  cell 

voronoi  diagrams  records  and  the  pointers  to  and  from  them. 

The  procedures  to  handle  the  events  are  defined  as  follows. 
HandleSiteEvent(p,) 

1 .  If  O'  is  empty,  insert  /?,  into  it  (so  that  7  consists  of  a  single  leaf  storing  pi) 
and  return.  Otherwise,  continue  with  steps  2-  5. 

2.  Search  in  O'  for  the  arc  a  vertically  above  If  the  leaf  representing  a  has 
a  pointer  to  a  circle  event  in  Q,  then  this  circle  event  is  a  false  alarm  and  it 
must  be  deleted  from  Q. 

3.  Replace  the  leaf  of  T  that  represents  a  with  a  subtree  having  three  leaves. 
The  middle  leaf  stores  the  new  site  pi  and  the  other  two  leaves  store  the  site 
pj  that  was  originally  stored  with  a.  Store  the  tuples  ( Pj,Pi )  and  ( Pi,Pj ) 
representing  the  new  breakpoints  at  the  two  new  internal  nodes.  Perform 
rebalancing  operations  on  T  if  necessary. 

4.  Create  new  half-edge  records  in  the  Voronoi  diagram  structure  for  the 
edge  separating  V(pf)  and  V(pj),  which  will  be  traced  out  by  the  two  new 
breakpoints. 

5.  Check  the  triple  of  consecutive  arcs  where  the  new  arc  for  pi  is  the  left  arc 
to  see  if  the  breakpoints  converge.  If  so,  insert  the  circle  event  into  Q  and 
add  pointers  between  the  node  in  7  and  the  node  in  Q.  Do  the  same  for  the 
triple  where  the  new  arc  is  the  right  arc. 

HandleCircleEvent(7) 

1.  Delete  the  leaf  y  that  represents  the  disappearing  arc  a  from  7.  Update 
the  tuples  representing  the  breakpoints  at  the  internal  nodes.  Perform 
rebalancing  operations  on  T  if  necessary.  Delete  all  circle  events  involving 
a  from  Q;  these  can  be  found  using  the  pointers  from  the  predecessor  and 
the  successor  of  y  in  ‘T.  (The  circle  event  where  a  is  the  middle  arc  is 
currently  being  handled,  and  has  already  been  deleted  from  Q.) 

2.  Add  the  center  of  the  circle  causing  the  event  as  a  vertex  record  to  the 
doubly-connected  edge  list  ©  storing  the  Voronoi  diagram  under  construc¬ 
tion.  Create  two  half-edge  records  corresponding  to  the  new  breakpoint 
of  the  beach  line.  Set  the  pointers  between  them  appropriately.  Attach  the 
three  new  records  to  the  half-edge  records  that  end  at  the  vertex. 

3.  Check  the  new  triple  of  consecutive  arcs  that  has  the  former  left  neighbor 
of  a  as  its  middle  arc  to  see  if  the  two  breakpoints  of  the  triple  converge. 
If  so,  insert  the  corresponding  circle  event  into  Q.  and  set  pointers  between 
the  new  circle  event  in  Q  and  the  corresponding  leaf  of  T.  Do  the  same  for 
the  triple  where  the  former  right  neighbor  is  the  middle  arc. 

Lemma  7.9  The  algorithm  runs  in  0(n  log/;)  time  and  it  uses  0(/i)  storage. 

Proof.  The  primitive  operations  on  the  tree  7  and  the  event  queue  Q,  such 

as  inserting  or  deleting  an  element,  take  O(logn)  time  each.  The  primitive 

operations  on  the  doubly-connected  edge  list  take  constant  time.  To  handle 

an  event  we  do  a  constant  number  of  such  primitive  operations,  so  we  spend 
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O(logH)  time  to  process  an  event.  Obviously,  there  are  n  site  events.  As  for  Section  7.2 

the  number  of  circle  events,  we  observe  that  every  such  event  that  is  processed  computing  the  voronoi  diagram 

defines  a  vertex  of  Vor(P).  Note  that  false  alarms  are  deleted  from  Q  before 

they  are  processed.  They  are  created  and  deleted  while  processing  another,  real 

event,  and  the  time  we  spend  on  them  is  subsumed  tinder  the  time  we  spend  to 

process  this  event.  Hence,  the  number  of  circle  events  that  we  process  is  at  most 

2n  —  5.  The  time  and  storage  bounds  follow.  Li 


Before  we  state  the  final  result  of  this  section  we  should  say  a  few  words  about 
degenerate  cases. 

The  algorithm  handles  the  events  from  top  to  bottom,  so  there  is  a  degeneracy 
when  two  or  more  events  lie  on  a  common  horizontal  line.  This  happens,  for 
example,  when  there  are  two  sites  with  the  same  y-coordinate.  These  events  can 
be  handled  in  any  order  when  their  x-coordi  nates  are  distinct,  so  we  can  break 
ties  between  events  with  the  same  y-coordinate  but  with  different  x-coordinates 
arbitrarily.  However,  if  this  happens  right  at  the  start  of  the  algorithm,  that  is, 
if  the  second  site  event  has  the  same  y-coordinate  as  the  first  site  event,  then 
special  code  is  needed  because  there  is  no  arc  above  the  second  site  yet.  Now 
suppose  there  are  event  points  that  coincide.  For  instance,  there  will  be  several 
coincident  circle  events  when  there  are  four  or  more  co-circular  sites,  such  that 
the  interior  of  the  circle  through  them  is  empty.  The  center  of  this  circle  is  a 
vertex  of  the  Voronoi  diagram.  The  degree  of  this  vertex  is  at  least  four.  We 
could  write  special  code  to  handle  such  degenerate  cases,  but  there  is  no  need  to 
do  so.  What  will  happen  if  we  let  the  algorithm  handle  these  events  in  arbitrary 
order?  Instead  of  producing  a  vertex  with  degree  four,  it  will  just  produce  two 
vertices  with  degree  three  at  the  same  location,  with  a  zero  length  edge  between 
them.  These  degenerate  edges  can  be  removed  in  a  post-processing  step,  if 
required. 

Besides  these  degeneracies  in  choosing  the  order  of  the  events  we  may  also 
encounter  degeneracies  while  handling  an  event.  This  occurs  when  a  site  p,-  that 
we  process  happens  to  be  located  exactly  below  the  breakpoint  between  two  arcs 
on  the  beach  line.  In  this  case  the  algorithm  splits  either  of  these  two  arcs  and 
inserts  the  arc  for  p,  in  between  the  two  pieces,  one  of  which  has  zero  length. 
This  piece  of  zero  length  now  is  the  middle  arc  of  a  triple  that  defines  a  circle 
event.  The  lowest  point  of  this  circle  coincides  with  /?,  .  The  algorithm  inserts 
this  circle  event  into  the  event  queue  Q,  because  there  are  three  consecutive  arcs 
on  the  beach  line  that  define  it.  When  this  circle  event  is  handled,  a  vertex  of 
the  Voronoi  diagram  is  correctly  created  and  the  zero  length  arc  can  be  deleted 
later.  Another  degeneracy  occurs  when  three  consecutive  arcs  on  the  beach  line 
are  defined  by  three  collinear  sites.  Then  these  sites  don’t  define  a  circle,  nor  a 
circle  event. 

We  conclude  that  the  above  algorithm  handles  degenerate  cases  correctly. 


Theorem  7.10  The  Voronoi  diagram  of  a  set  of  n  point  sites  in  the  plane  can  be 
computed  with  a  sweep  line  algorithm  in  0(n  log/;)  time  using  0(n)  storage. 
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7.3  Voronoi  Diagrams  of  Line  Segments 

The  Voronoi  diagram  can  also  be  defined  for  objects  other  than  points.  The 
distance  from  a  point  in  the  plane  to  an  object  is  then  measured  to  the  closest 
point  on  the  object.  Whereas  the  bisector  of  two  points  is  simply  a  line,  the 
bisector  of  two  disjoint  line  segments  has  a  more  complex  shape.  It  consists  of 
up  to  seven  parts,  where  each  part  is  either  a  line  segment  or  a  parabolic  arc. 
Parabolic  arcs  occur  if  the  closest  point  of  one  line  segment  is  an  endpoint  and 
the  closest  point  of  the  other  line  segment  is  in  its  interior.  In  all  other  cases  the 
bisector  part  is  straight.  Although  bisectors  and  therefore  the  Voronoi  diagram 
are  somewhat  more  complex,  the  number  of  vertices,  edges,  and  faces  in  the 
Voronoi  diagram  of  n  disjoint  line  segments  is  still  only  0(n). 

Assume  for  a  moment  that  we  allow  the  line  segments  to  be  non-crossing, 
that  is,  we  allow  them  to  share  endpoints.  Then  a  whole  region  of  the  plane  can 
be  equally  close  to  two  line  segments  via  their  common  endpoint,  and  bisectors 
are  not  even  curves  anymore.  To  avoid  the  complications  that  arise  in  defining 
and  computing  Voronoi  diagrams  of  line  segments  that  share  endpoints,  we 
will  simply  assume  here  that  all  line  segments  are  strictly  disjoint.  In  many 
applications  we  can  simply  shorten  the  line  segments  very  slightly  to  obtain 
disjoint  line  segments. 

The  sweep  line  algorithm  for  points  can  be  adapted  to  the  case  of  line 
segment  sites.  Let  S  =  {si, . . . , sn }  be  a  set  of  n  disjoint  line  segments.  We 
call  the  segments  of  S  sites  as  before,  and  use  the  terms  site  endpoint  and  site 
interior  in  the  following  description. 


Figure  7.5 

The  beach  line  for  a  set  of  line  segment 
sites.  The  breakpoints  trace  the  dashed 
arcs,  which  include  the  Voronoi  edges 


Recall  that  our  algorithm  for  point  sites  maintained  a  beach  line:  a  piecewise 
parabolic  ^-monotone  curve  such  that,  for  points  on  the  curve,  the  distance  to 
the  closest  site  above  the  sweep  line  is  equal  to  the  distance  to  the  sweep  line. 
What  does  the  beach  line  look  like  when  the  sites  are  segments?  First  we  note 
that  a  line  segment  site  may  be  partially  above  and  partially  below  the  sweep 
line.  When  defining  the  beach  line,  we  consider  only  those  parts  of  the  sites  that 
are  above  the  sweep  line.  Hence,  for  a  given  position  of  the  sweep  line  t,  the 
beach  line  consists  of  those  points  such  that  the  distance  to  the  closest  portion 
of  a  site  above  l  is  equal  to  the  distance  to  t.  This  means  that  the  beach  line 
now  consists  of  parabolic  arcs  and  straight  line  segments.  A  parabolic  arc  arises 
when  that  part  of  the  beach  line  is  closest  to  a  site  endpoint,  and  a  straight  line 
segment  arises  when  that  part  of  the  beach  line  is  closest  to  a  site  interior.  Note 
that  if  a  site  interior  intersects  l,  then  the  beach  line  will  have  two  straight  line 
segments  ending  at  the  intersection — see  site  S2  in  Figure  7.5. 
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The  breakpoints  between  parabolic  arcs  and  straight  segments  on  the  beach  Section  7.3 
line  arise  in  several  different  ways.  Figure  7.5  illustrates  this.  Assume  any  voronoi  diagrams  of  line 

position  l  of  the  sweep  line  during  the  downward  sweep  to  analyze  the  types  of  segments 

breakpoint: 

■  If  a  point  p  is  closest  to  two  site  endpoints  while  being  equidistant  from 
them  and  l,  then  p  is  a  breakpoint  that  traces  a  line  segment  (as  in  the  point 
site  case). 

■  If  a  point  p  is  closest  to  two  site  interiors  while  being  equidistant  from  them 
and  £,  then  p  is  a  breakpoint  that  traces  a  line  segment. 

■  If  a  point  p  is  closest  to  a  site  endpoint  and  a  site  interior  of  different  sites 
while  being  equidistant  from  them  and  £,  then  p  is  a  breakpoint  that  traces  a 
parabolic  arc. 

■  If  a  point  p  is  closest  to  a  site  endpoint,  the  shortest  distance  is  realized  by  a 
segment  that  is  perpendicular  to  the  line  segment  site,  and  p  has  the  same 
distance  from  l,  then  p  is  a  breakpoint  that  traces  a  line  segment. 

■  If  a  site  interior  intersects  the  sweep  line,  then  the  intersection  is  a  breakpoint 
that  traces  a  line  segment  (the  site  interior). 


In  the  fourth  and  fifth  cases,  the  breakpoint  does  not  actually  trace  an  arc  of  the 
Voronoi  diagram,  because  only  one  site  is  involved.  For  the  proper  operation  of 
the  algorithm,  dealing  with  such  breakpoints  and  corresponding  events  is  still 
necessary. 

As  in  the  sweep  line  algorithm  for  point  sites,  we  again  have  site  events  and 
circle  events.  A  site  event  occurs  when  the  sweep  line  reaches  a  site  endpoint. 
Obviously,  site  events  at  upper  endpoints  should  be  handled  differently  from 
site  events  at  lower  endpoints.  At  an  upper  endpoint,  an  arc  of  the  beach  line  is 
split  into  two,  and  in  between,  four  new  arcs  appear.  The  breakpoints  between 
these  four  arcs  are  of  the  last  two  types.  At  a  lower  endpoint,  the  breakpoint 
that  is  the  intersection  of  the  site  interior  with  the  sweep  line  is  replaced  by  two 
breakpoints  of  the  fourth  type,  with  a  parabolic  arc  in  between  (for  the  newly 
discovered  site  endpoint). 

Similarly,  there  are  several  types  of  circle  event.  They  all  correspond  to  the 
disappearance  of  an  arc  of  the  beach  line,  and  they  occur  when  the  sweep  line 
reaches  the  bottom  of  an  empty  circle  that  is  defined  by  two  or  three  sites  above 
the  sweep  line.  The  centers  of  these  empty  circles  are  at  locations  where  two 
consecutive  breakpoints  will  meet.  Depending  on  the  types  of  the  breakpoints 
that  meet,  several  different  cases  can  be  distinguished  and  handled.  If  the  two 
breakpoints  are  of  any  of  the  first  three  types,  then  three  sites  are  involved.  If 
one  of  the  breakpoints  is  of  the  fourth  type,  then  only  two  sites  are  involved. 
Breakpoints  of  the  fifth  type  cannot  be  involved  for  disjoint  line  segments. 

Notice  that  the  Voronoi  diagram  that  the  algorithm  computes  is  a  subdivision 
with  straight  edges  and  parabolic  arcs.  Can  we  store  this  type  of  subdivision  in 
a  doubly-connected  edge  list?  This  is  indeed  possible,  and  the  structure  need 
not  even  be  adapted.  With  each  face,  we  store  the  corresponding  site,  so  for  any 
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Chapter  7 

VORONOI  DIAGRAMS 


Figure  7.6 

Voronoi  diagram  of  line  segments,  and 
start  and  end  positions  of  a  disc 
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half-edge  e  we  can  determine  the  two  sites  that  have  e  on  their  bisector  (using 
IncidentFace(e)  and  IncidentFace(Twin(e))).  Since  we  can  also  easily  find  the 
two  vertices  between  which  the  edge  lies  ( Origin(e )  and  Origin  (Twin  (e))),  we 
can  determine  the  shape  of  any  edge  in  constant  time. 

The  whole  sweep  line  algorithm  is  now  just  an  extension  of  the  one  for  point 
sites,  with  more  cases  to  be  distinguished  and  handled.  However,  the  algorithm 
still  has  only  0(n)  events,  and  each  can  be  handled  in  O(\ogn)  time. 

Theorem  7.11  The  Voronoi  diagram  of  a  set  of  n  disjoint  line  segment  sites  can 
be  computed  in  O(nlogn)  time  using  0(n)  storage. 

One  of  the  applications  of  the  Voronoi  diagram  for  line  segments  is  in 
motion  planning  (covered  more  extensively  in  Chapter  13).  Assume  that  a  set 
of  obstacles  is  given,  consisting  of  n  line  segments  in  total,  and  that  we  have 
a  robot  A.  We  assume  that  the  robot  can  move  freely  in  all  directions,  and  is 
approximated  well  by  an  enclosing  disc  D.  Suppose  that  we  wish  to  find  a 
collision-free  motion  from  one  location  of  the  robot  to  another,  or  to  decide  that 
none  exists. 

One  motion-planning  technique  is  called  retraction.  The  idea  of  retraction  is 
that  the  arcs  of  the  Voronoi  diagram  define  the  middle  between  the  line  segments, 
and  therefore  define  a  path  with  the  most  clearance.  So  a  path  over  the  arcs 
of  the  Voronoi  diagram  is  the  best  option  for  a  collision-free  path.  Figure  7.6 
shows  a  set  of  line  segments  inside  a  rectangle,  together  with  a  Voronoi  diagram 
of  the  line  segments  and  the  sides  of  the  rectangle. 


We  can  determine  a  collision-free  path  between  two  disc  positions  amidst  a 
set  of  line  segments  with  the  following  algorithm. 


Algorithm  RETRACTION^, qstan, qcnd,r)  Section  7.4 

Input.  A  set  S  :=  { .v  | , . . . ,  ,s„ }  of  disjoint  line  segments  in  the  plane,  and  two  farthest-point  voronoi 

discs  Dstart  and  Dead  centered  at  <r/start  and  <r/cnci  with  radius  r.  The  two  disc  diagrams 

positions  do  not  intersect  any  line  segment  of  5. 

Output.  A  path  that  connects  qsta rt  to  qe nd  such  that  no  disc  of  radius  r  with  its 
center  on  the  path  intersects  any  line  segment  of  S.  If  no  such  path  exists,  this 
is  reported. 

1 .  Compute  the  Voronoi  diagram  Vor(S)  of  5  inside  a  sufficiently  large  bound¬ 
ing  box. 

2.  Locate  the  cells  of  Vor(P)  that  contain  qstan  and  qenA. 

3.  Determine  the  point  pst art  on  Vor(S)  by  moving  gstart  away  from  the  nearest 
line  segment  in  S.  Similarly,  determine  the  point  penA  on  Vor(S)  by  moving 
<?cnd  away  from  the  nearest  line  segment  in  S.  Add  pstart  and  pend  as  vertices 
to  Vor(S),  splitting  the  arcs  on  which  they  lie  into  two. 

4.  Let  S  be  the  graph  corresponding  to  the  vertices  and  edges  of  the  Voronoi 
diagram.  Remove  all  edges  from  9  for  which  the  smallest  distance  to  the 
nearest  sites  is  smaller  than  or  equal  to  r. 

5.  Determine  with  depth-first  search  whether  a  path  exists  from  pstart  to  penA 
in  9-  If  so,  report  the  line  segment  from  qslan  to  pstar t,  the  path  in  9  from 
/"start  to  Pend-  and  the  line  segment  from  pen( j  to  qtnA  as  the  path.  Otherwise, 
report  that  no  path  exists. 

The  line  segment  connecting  qstan  to  pstart  cannot  give  a  collision,  because  the 
disc  only  moves  further  away  from  the  nearest  obstacle.  Similarly,  the  line 
segment  between  penA  and  qenA  is  collision-free.  For  any  two  discs  centered  on 
the  Voronoi  diagram,  a  collision-free  path  between  them  exists  on  the  Voronoi 
diagram  if  and  only  if  such  a  path  exists  at  all.  Hence,  for  a  disc-shaped  robot,  a 
path  is  found  if  one  exists. 

Theorem  7.12  Given  n  disjoint  line  segment  obstacles  and  a  disc-shaped  robot, 
the  existence  of  a  collision-free  path  between  two  positions  of  the  robot  can  be 
determined  in  0(n  log/;)  time  using  0(n)  storage. 


7.4  Farthest-Point  Voronoi  Diagrams 

We  now  continue  with  a  different  application  where  Voronoi  diagrams  are 
needed.  When  objects  are  manufactured,  slight  deviations  in  the  shapes  of 
the  objects  will  occur.  When  the  objects  need  to  be  perfectly  round,  the  man¬ 
ufactured  objects  are  tested  for  their  roundness.  This  is  done  by  coordinate 
measurement  machines,  which  sample  points  on  the  surface  of  the  object.  As¬ 
sume  that  we  have  constructed  a  disc,  and  wish  to  determine  its  roundness.  The 
machine  gives  us  a  set  P  of  points  in  the  plane  that  lie  nearly  on  a  circle.  The 
roundness  of  a  set  of  points  is  defined  as  the  width  of  the  smallest-width  annulus 
that  contains  the  points.  An  annulus  is  the  region  between  two  concentric  circles, 
and  its  width  is  the  difference  between  the  radii  of  those  circles. 

The  smallest-width  annulus  must  of  course  have  some  points  of  the  set  P  on 
its  bounding  circles.  Let  us  call  the  outer  circle  Couter  and  the  inner  circle  Cjnner- 
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Chapter  7  Clearly,  there  must  be  at  least  one  point  on  Couler,  otherwise  we  can  reduce  the 

voronoi  diagrams  size  of  Couter,  and  at  least  one  point  on  Cinner.  otherwise  we  can  increase  the  size 

of  Cinner-  But  one  point  on  each  bounding  circle  cannot  give  us  a  smallest-width 
annulus  yet.  There  appear  to  be  three  different  cases,  each  with  a  total  of  four 
points  on  the  two  circles  (Figure  7.7): 

■  Corner  contains  at  least  three  points  of  P,  and  Cinncr  contains  at  least  one 
point  of  P. 

m  Corner  contains  at  least  one  point  of  P,  and  Cinncr  contains  at  least  three  points 
of  P. 

m  Couter  and  Cjnncr  both  contain  two  points  of  P. 


Figure  7.7 

Three  cases  of  the  smallest-width 
annulus 


If  Cinner  or  Couter  contains  fewer  points  than  listed  in  any  of  these  cases,  then 
we  can  always  find  an  annulus  with  a  smaller  width.  The  problem  of  finding  the 
smallest-width  annulus  enclosing  a  given  point  set  looks  similar  to  the  problem 
of  finding  the  smallest  disc  enclosing  a  point  set,  studied  in  Section  4.7.  The 
technique  we  used  for  the  smallest-disc  problem,  however,  does  not  work  for 
the  smallest-width  annulus:  the  property  that  an  added  point  that  does  not  lie  in 
the  optimal  annulus  so  far  must  always  lie  on  the  boundary  of  the  new  optimal 
annulus  does  not  hold. 

Finding  the  smallest-width  annulus  is  equivalent  to  finding  its  center  point. 
Once  the  center  point — let’s  call  it  q — is  fixed,  the  annulus  is  determined  by 
the  points  of  P  that  are  closest  to  and  farthest  from  q.  If  we  have  the  Voronoi 
diagram  of  P ,  then  the  closest  point  is  the  one  in  whose  cell  q  lies.  It  turns  out 
that  a  similar  structure  exists  for  the  farthest  point,  namely  the  farthest-point 
Voronoi  diagram.  This  divides  the  plane  into  cells  in  which  the  same  point 
of  P  is  the  farthest  point.  The  farthest-point  Voronoi  cell  of  a  point  p,  is  the 
intersection  of  n  —  1  half-planes,  just  as  for  a  standard  Voronoi  cell,  but  we  take 
the  “other  sides”  of  the  bisectors,  the  sides  where  p ,  is  farther  away.  Hence,  all 
cells  of  the  farthest-point  Voronoi  diagram  are  convex.  Not  every  point  of  P  has 
a  cell  in  the  farthest-point  Voronoi  diagram:  the  intersections  of  the  half-planes 
can  be  empty.  It  is  not  hard  to  see  that  for  any  point  in  the  plane,  its  farthest 
point  in  the  set  P  must  be  a  point  that  lies  on  the  convex  hull  of  P.  Therefore, 
a  point  that  lies  inside  the  convex  hull  cannot  have  a  cell  in  the  farthest-point 
Voronoi  diagram. 

Observation  7.13  Given  a  set  P  of  points  in  the  plane,  a  point  of  P  has  a  cell  in 
the  farthest-point  Voronoi  diagram  if  and  only  if  it  is  a  vertex  of  the  convex  hull 
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We  can  prove  more  properties  of  the  farthest-point  Voronoi  diagram.  Sup¬ 
pose  that  a  point  p,  £  P  lies  on  the  convex  hull,  and  let  q  be  some  point  in 
the  plane  for  which  /?,■  is  the  farthest  point.  Let  /(/?,.£/)  be  the  line  through  p, 
and  q.  Then  all  points  on  the  half-line  starting  at  q ,  contained  in  t(p,,q),  and 
not  containing  /?,,  must  also  be  in  the  farthest-point  Voronoi  cell  of  /;,.  This 
implies  that  all  cells  are  unbounded.  The  vertices  and  edges  of  the  farthest-point 
Voronoi  diagram  form  a  tree-like  structure  (in  the  graph  sense),  because  the 
diagram  is  connected  and  does  not  have  cycles.  A  cycle  would  imply  a  bounded 
cell. 

We  can  show  that  the  farthest-point  Voronoi  diagram  of  n  points  has  0(n) 
vertices,  edges,  and  cells  (see  also  Exercise  7.14).  There  is  another  interesting 
property:  the  center  of  the  smallest  enclosing  disc  (see  Section  4.7)  is  either 
a  vertex  of  the  farthest-point  Voronoi  diagram  or  the  midpoint  of  two  sites 
defining  an  edge  of  the  farthest-point  Voronoi  diagram.  In  the  former  case,  there 
are  three  equidistant  farthest  points,  and  in  the  latter  case,  two.  Clearly,  the 
center  of  the  smallest  enclosing  disc  cannot  have  just  one  point  that  is  farthest 
from  it. 

Since  the  farthest-point  Voronoi  diagram  has  half-infinite  edges,  we  cannot 
store  it  in  a  doubly-connected  edge  list,  but  we  can  adapt  the  structure  slightly 
to  deal  with  such  subdivisions.  We  use  a  special  vertex-like  record  as  the  origin 
of  each  half-edge  that  has  no  real  vertex  as  its  origin.  These  new  records  store 
the  direction  of  the  half-infinite  edge  instead  of  coordinates.  Furthermore,  half¬ 
edge  records  corresonding  to  half-infinite  edges  have  either  Next(e)  or  Prev(e ) 
undefined.  We  shall  still  use  the  term  “doubly-connected  edge  list’’  for  this 
adapted  version. 

We  now  present  an  algorithm  to  compute  the  farthest-point  Voronoi  diagram 
of  a  set  P  of  n  points  in  the  plane.  First,  we  compute  the  convex  hull  of  P,  take 
its  vertices,  and  put  them  in  random  order.  Let  this  random  order  be  p\ , . . .  ,ph- 
We  remove  the  points  ph,..., pi  one  by  one  from  the  cyclic  order,  and  when 
removing  /?,,  store  its  clockwise  neighbor  cw(pj)  and  counterclockwise  neighbor 
ccw(pj)  at  the  time  of  removal.  After  a  point  has  been  removed,  it  cannot  be 
the  clockwise  or  counterclockwise  neighbor  anymore  of  points  removed  later. 


Section  7.4 

FARTHEST-POINT  VORONOI 
DIAGRAMS 


\ 


Figure  7.8 

Addition  of  a  point  p,  to  the 
farthest-point  Voronoi  diagram  of 


We  compute  the  farthest-point  Voronoi  diagram  of  pi,P2,Pi  to  initialize 
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Chapter  7 

VORONOI  DIAGRAMS 


the  incremental  construction.  Then  we  insert  the  remaining  points  p4,...,ph 
while  constructing  the  farthest-point  Voronoi  diagram.  To  be  able  to  add  the 
farthest-point  Voronoi  cell  of  /?,  efficiently,  given  the  farthest-point  Voronoi 
diagram  of  {p\, . . .  .p,-\ } ,  we  maintain  a  pointer  for  each  point  p /,  1  <  j  <  U 
to  the  half-infinite  half-edge  of  the  doubly-connected  edge  list  that  is  most 
counterclockwise  in  a  traversal  of  the  boundary  of  the  farthest-point  Voronoi 
cell  of  pj. 

We  now  look  at  the  addition  of  the  cell  of  p,  in  more  detail,  see  Figure  7.8. 
The  cell  will  come  “in  between”  the  cells  of  cw(pj)  and  ccw(pi).  Just  before 
Pi  is  added,  cw(p,)  and  ccw(pj)  are  each  other’s  neighbors  on  the  convex  hull 
of  {pi,. . .  ,Pi~  i },  so  their  cells  are  separated  by  a  half-infinite  edge  that  is  part 
of  their  bisector.  The  point  ccw(pi)  has  a  pointer  to  this  edge.  The  bisector 
of  pj  and  ccw(pi)  will  give  a  new  half-infinite  edge  that  lies  in  the  farthest- 
point  Voronoi  cell  of  ccw(  /?,■),  and  is  part  of  the  boundary  of  the  farthest-point 
Voronoi  cell  of  pj.  We  traverse  the  cell  of  ccw(pi)  in  the  clockwise  direction 
to  see  which  edge  the  bisector  intersects.  On  the  other  side  of  this  edge  is 
the  farthest-point  Voronoi  cell  of  another  point  pj  from  {p\, . . . ,  i },  and  the 
bisector  of  pj  and  p,  will  also  give  an  edge  of  the  farthest-point  Voronoi  cell 
of  p\.  We  again  traverse  the  cell  of  p:  in  the  clockwise  direction  to  determine 
where  the  other  insertion  of  the  cell  boundary  and  the  bisector  is  located.  By 
tracing  cell  boundaries  in  clockwise  order,  we  trace  the  farthest-point  Voronoi 
cell  in  counterclockwise  order.  The  last  bisector  that  we  will  find  is  with  cw(pf), 
and  it  will  give  a  new  half-infinite  edge  in  the  farthest-point  Voronoi  diagram. 
All  new  edges  found  are  added  to  the  doubly-connected  edge  list  representation, 
after  which  all  edges  that  lie  inside  the  farthest-point  Voronoi  cell  of  p,  are 
removed.  They  are  no  longer  valid  edges  of  the  farthest-point  Voronoi  diagram 

of  {pi,...,  Pi}. 

In  short,  the  insertion  of  the  next  farthest-point  Voronoi  cell  is  done  by 
tracing  the  new  cell  with  the  help  of  the  existing  diagram,  adding  the  new  edges, 
and  removing  the  edges  that  have  become  obsolete. 


Theorem  7.14  Given  a  set  of  n  points  in  the  plane,  its  farthest-point  Voronoi 
diagram  can  be  computed  in  0(n  log/;)  expected  time  using  0(n)  storage. 

Proof.  It  takes  0(n  log/;)  time  to  compute  the  h  points  on  the  convex  hull  in 
counterclockwise  order.  The  farthest-point  Voronoi  diagram  actually  takes  only 
0(h)  expected  time  to  construct  after  we  have  the  points  on  the  convex  hull 
in  sorted  order.  To  see  this,  we  apply  backwards  analysis.  We  consider  the 
situation  after  the  insertion  of  the  cell  of  We  observe  that  if  the  cell  of  /?,-  has 
k  edges  on  its  boundary,  then  the  traversal  performed  to  trace  this  cell  visited 
k  cells  in  the  farthest-point  Voronoi  diagram  of  {pi,. . . , Pi  - 1  } ,  and  visited  at 
most  4k  —  6  boundary  edges  of  these  cells  in  total. 

The  farthest-point  Voronoi  diagram  of  {pi ,... ,/?,}  has  at  most  2 i  —  3  edges 
(see  Exercise  7.14),  each  used  by  two  cells.  Since  every  point  of  {pi, . . . ,/?,} 
has  the  same  probability  of  having  been  the  last  one  added,  the  expected  size 
of  the  cell  of  /;,  is  less  than  four.  Hence,  the  expected  time  needed  for  each 
insertion  is  0(1),  and  the  algorithm  runs  in  0(h)  expected  time.  0 


Now  we  return  to  the  problem  of  computing  the  smallest-width  annulus. 
Suppose  that  the  smallest-width  annulus  is  such  that  Cinner  contains  at  least  three 
points  of  P.  Then  its  center  is  a  vertex  of  the  normal  Voronoi  diagram  of  P. 
Similarly,  if  the  smallest-width  annulus  is  such  that  Couter  contains  at  least  three 
points  of  P,  its  center  is  a  vertex  of  the  farthest-point  Voronoi  diagram  of  P. 
Finally,  if  the  smallest-width  annulus  is  such  that  Cinner  and  Couter  both  contain 
two  points  of  P,  then  its  center  must  lie  on  an  edge  of  the  Voronoi  diagram  and 
on  an  edge  of  the  farthest-point  Voronoi  diagram  simultaneously.  This  means 
that  we  can  obtain  a  reasonably  small  set  of  points  that  must  contain  the  center 
of  a  smallest-width  annulus. 

To  do  this,  we  generate  the  vertices  of  the  overlay  of  the  Voronoi  diagram 
and  the  farthest-point  Voronoi  diagram.  The  vertices  of  the  overlay  are  exactly 
the  candidate  centers  of  the  smallest-width  annulus,  covering  all  three  cases.  We 
don’t  really  need  to  compute  the  overlay  itself.  Once  we  know  a  vertex  and  the 
four  points  that  determine  Cinner  and  Couter.  we  can  compute  the  smallest-width 
annulus  of  those  four  points  directly  in  (9(1)  time.  This  is  a  candidate  for  the 
smallest-width  annulus. 

The  whole  algorithm  to  compute  the  smallest-width  annulus  of  a  set  P  of  n 
points  in  the  plane  is  as  follows.  Compute  the  Voronoi  diagram  and  the  farthest- 
point  Voronoi  diagram  of  P.  For  each  vertex  of  the  farthest-point  Voronoi 
diagram,  determine  the  point  of  P  that  is  closest.  For  each  vertex  of  the  normal 
Voronoi  diagram,  determine  the  point  of  P  that  is  farthest.  This  gives  us  0{n) 
sets  of  four  points  that  define  the  candidate  annuli  in  the  first  and  second  cases. 
Next,  for  every  pair  of  edges,  one  from  each  of  the  diagrams,  test  if  they  intersect. 
If  so,  we  have  another  set  of  four  points  that  forms  a  candidate  annulus.  For 
all  candidates  of  all  three  types,  choose  the  one  that  gives  the  smallest-width 
annulus  as  the  solution. 
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Theorem  7.15  Given  a  setP  ofn  points  in  the  plane,  the  smallest-width  annulus 
(and  the  roundness)  can  be  determined  in  0{n2)  time  using  0{n)  storage. 


7.5  Notes  and  Comments 

Although  it  is  beyond  the  scope  of  this  book  to  give  an  extensive  survey  of 
the  history  of  Voronoi  diagrams  it  is  appropriate  to  make  a  few  historical 
remarks.  Voronoi  diagrams  are  often  attributed  to  Dirichlet  [148] — hence  the 
name  Dirichlet  tessellations  that  is  sometimes  used — and  Voronoi  [379,  380]. 

They  can  be  found  in  Descartes’s  treatment  of  cosmic  fragmentation  in  Part 
III  of  his  Principia  Philosophiae,  published  in  1644.  In  the  twentieth  century, 
the  Voronoi  diagram  was  rediscovered  several  times.  In  biology  this  even 
happened  twice  in  a  very  short  period.  In  1965  Brown  [75]  studied  the  intensity 
of  trees  in  a  forest.  He  defined  the  area  potentially  available  to  a  tree,  which 
was  in  fact  the  Voronoi  cell  of  that  tree.  One  year  later  Mead  [272]  used  the 
same  concept  for  plants,  calling  the  Voronoi  cells  plant  polygons.  Now,  there 

is  an  impressive  amount  of  literature  concerning  Voronoi  diagrams  and  their  _ 

applications  in  all  kinds  of  research  areas.  The  book  by  Okabe  et  al.  [297]  167 


Chapter  7  contains  an  ample  treatment  of  Voronoi  diagrams  and  their  applications.  We 
voronoi  diagrams  confine  ourselves  in  this  section  to  a  discussion  of  the  various  aspects  of  Voronoi 
diagrams  encountered  in  the  computational  geometry  literature. 


z  =  x2+y2 


In  this  chapter  we  have  proved  some  properties  of  the  Voronoi  diagram,  but  it 
has  many  more.  For  example,  if  one  connects  all  the  pairs  of  sites  whose  Voronoi 
cells  are  adjacent  then  the  resulting  set  of  segments  forms  a  triangulation  of  the 
point  set,  called  the  Delaunay  triangulation.  This  triangulation,  which  has  some 
very  nice  properties,  is  the  topic  of  Chapter  9. 

There  is  a  beautiful  connection  between  Voronoi  diagrams  and  convex 
polyhedra.  Consider  the  transformation  that  maps  a  point  p  =  (px,py)  in  E2  to 
the  non-vertical  plane  h(p)  :  z  =  2pxx  +  2pyy  —  (p2  +  p2)  in  E3.  Geometrically, 
h{p)  is  the  plane  that  is  tangent  to  the  unit  paraboloid  11 :  z  =  x2  +y2  at  the  point 
vertically  above  (px,py,  0).  For  a  set  P  of  point  sites  in  the  plane,  let  H(P)  be 
the  set  of  planes  that  are  the  images  of  the  sites  in  P.  Now  consider  the  convex 
polyhedron  T  that  is  the  intersection  of  all  positive  half-spaces  defined  by  the 
planes  in  H(P ),  that  is,  T  :=  fj/i eh(p)  h+>  where  h+  denotes  the  half-space  above 
h.  Surprisingly,  if  we  project  the  edges  and  vertices  of  the  polyhedron  vertically 
downwards  onto  the  Ay-plane,  we  get  the  Voronoi  diagram  of  P  [167].  See 
Chapter  1 1  for  a  more  extensive  description  of  this  transformation.  A  similar 
transformation  exists  for  the  farthest-point  Voronoi  diagram. 


We  have  studied  Voronoi  diagrams  in  their  most  basic  setting,  namely  for  a  set 
of  point  sites  in  the  Euclidean  plane.  The  first  optimal  O(nlogn)  time  algorithm 
for  this  case  was  a  divide-and-conquer  algorithm  presented  by  Shamos  and 
Hoey  [350];  since  then,  many  other  optimal  algorithms  have  been  developed. 
The  plane  sweep  algorithm  that  we  described  is  due  to  Fortune  [183].  Fortune’s 
original  description  of  the  algorithm  is  a  little  different  from  ours,  which  follows 
the  interpretation  of  the  algorithm  given  by  Guibas  and  Stolfi  [203]. 

Voronoi  diagrams  can  be  generalized  in  many  ways  [28,  297].  One  generaliza¬ 
tion  is  to  point  sets  in  higher-dimensional  spaces.  In  Erf,  the  maximum  combina¬ 
torial  complexity  of  the  Voronoi  diagram  of  a  set  of  n  point  sites  (the  maximum 
number  of  vertices,  edges,  and  so  on,  of  the  diagram)  is  0(«  ^/21 )  [239]  and  it 
can  be  computed  in  0{n  logn  +  « l^/2l )  optimal  time  [93,  133,  346].  The  fact 
that  the  dual  of  the  Voronoi  diagram  is  a  triangulation  of  the  set  of  sites,  and 
the  connection  between  Voronoi  diagrams  and  convex  polyhedra  as  discussed 
above  still  hold  in  higher  dimensions. 

Another  generalization  concerns  the  metric  that  is  used.  In  the  L\ -metric,  or 
Manhattan  metric,  the  distance  between  two  points  p  and  q  is  defined  as 

disti {p,q)  :=  \px~qx\  +  \py  ~qy\ , 

the  sum  of  the  absolute  differences  in  the  x-  and  y-coordinates.  In  a  Voronoi 
diagram  in  the  L\ -metric,  all  edges  are  horizontal,  vertical,  or  diagonal  (at  an 
angle  of  45°  to  the  coordinate  axes).  In  the  more  general  L;7-metric,  the  distance 
between  two  points  p  and  q  is  defined  as 
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distp{p,q)  :=  ^/\px-qx\r  +  \py-qy\P 


Note  that  the  Lo-metric  is  simply  the  Euclidean  metric.  There  are  several  papers  Section  7.5 

dealing  with  Voronoi  diagrams  in  these  metrics  [118,  248,  252].  One  can  also  notes  and  comments 

define  a  distance  function  by  assigning  a  weight  to  each  site.  Now  the  distance 

from  a  site  to  a  point  is  the  Euclidean  distance  to  the  point,  plus  its  additive 

weight.  The  resulting  diagrams  are  called  weighted  Voronoi  diagrams  [183]. 

The  weight  can  also  be  used  to  define  the  distance  from  a  site  to  a  point  as  the 
Euclidean  distance  times  the  weight.  Diagrams  based  on  this  multiplicatively 
weighted  distance  are  also  called  weighted  Voronoi  diagrams  [29].  Power 
diagrams  [25,  26,  27,  30]  are  another  generalization  of  Voronoi  diagrams  where 
a  different  distance  function  is  used.  It  is  even  possible  to  drop  the  distance 
function  altogether  and  define  the  Voronoi  diagram  in  terms  only  of  bisectors 
between  pairs  of  sites.  Such  diagrams  are  called  abstract  Voronoi  diagrams  [240, 

241,  242,  274], 


Other  generalizations  concern  the  shape  of  the  sites.  We  have  seen  the  Voronoi 
diagram  of  a  set  of  disjoint  line  segments  in  this  chapter.  We  discussed  the 
application  of  this  diagram  to  motion  planning  using  the  retraction  technique; 
Chapter  13  discusses  motion  planning  in  general. 

An  important  special  case  of  the  Voronoi  diagram  of  line  segments  is  the 
Voronoi  diagram  of  the  edges  of  a  simple  polygon,  interior  to  the  polygon  itself. 
Since  the  edges  share  endpoints,  there  can  be  whole  regions  inside  the  polygon 
where  two  edges  are  equally  close.  This  occurs  at  every  reflex  vertex  of  the 
polygon.  The  Voronoi  diagram  is  the  subdivision  of  the  interior  of  the  polygon 
into  faces  where  one  or  two  edges  are  the  closest.  This  Voronoi  diagram  is 
also  known  as  the  medial  axis  or  skeleton,  and  it  has  applications  in  shape 
analysis  [366,  377].  The  medial  axis  can  be  computed  in  time  linear  in  the 
number  of  edges  of  the  polygon  [123]. 

Instead  of  partitioning  the  space  into  regions  according  to  the  closest  sites, 
one  can  also  partition  it  according  to  the  k  closest  sites,  for  some  1  ^  k  ^  n  —  1. 
The  diagrams  obtained  in  this  way  are  called  higher-order  Voronoi  diagrams, 
and,  forgiven  k ,  the  diagram  is  called  the  order-/:  Voronoi  diagram  [6,  31, 70, 98]. 
Note  that  the  order- 1  Voronoi  diagram  is  nothing  more  than  the  standard  Voronoi 
diagram.  The  order-  (n  —  1)  Voronoi  diagram  is  the  farthest-point  Voronoi 
diagram,  because  the  Voronoi  cell  of  a  point  pi  is  now  the  region  of  points  for 
which  pi  is  the  farthest  site.  The  maximum  complexity  of  the  order-/:  Voronoi 
diagram  of  a  set  of  n  point  sites  in  the  plane  is  &(k(n  —  k))  [249].  Currently 
the  best  known  algorithms  for  computing  the  order-/:  Voronoi  diagram  run  in 
O(«log3  n  +  nk)  time  [6]  and  in  O(n\ogn  +  nk2cl°s  k)  time  [326],  where  c  is  a 
constant. 


The  farthest-point  Voronoi  diagram  takes  0{n\ogn)  time  to  compute,  but  if 
the  points  are  in  convex  position  and  are  given  in  the  order  along  the  convex 
hull,  then  there  exists  a  simple  0(n )  expected-time  algorithm  [116],  given  in 
this  chapter,  and  also  an  0(n)  time  deterministic  algorithm  [11].  Testing  the 
roundness  of  an  object  or  set  of  points  is  a  problem  that  arises  in  metrology, 

the  science  of  measurement.  Several  definitions  of  roundness  exist,  the  one  _ 

used  in  this  chapter  being  the  most  widely  accepted  one.  A  quadratic-time  169 


Chapter  7  algorithm  for  the  roundness  problem  was  given  by  Ebarra  et  al.  [155],  A  com- 
voronoi  diagrams  plex,  subquadratic-time  algorithm  was  suggested  by  Agarwal  and  Sharir  [9].  In 
special  cases  that  correspond  to  point  sets  that  may  occur  in  practice,  linear-time 
or  near-linear-time  algorithms  exist  [52,  142,  187],  A  survey  of  computational 
metrology  has  been  given  by  Yap  and  Chang  [396], 


7.6  Exercises 

7.1  Prove  that  for  any  n  >  3  there  is  a  set  of  n  point  sites  in  the  plane  such 
that  one  of  the  cells  of  Vor(P)  has  n  —  1  vertices. 

7.2  Show  that  Theorem  7.3  implies  that  the  average  number  of  vertices  of  a 
Voronoi  cell  is  less  than  six. 

7.3  Show  that  fl(nlogn)  is  a  lower  bound  for  computing  Voronoi  diagrams 
by  reducing  the  sorting  problem  to  the  problem  of  computing  Voronoi 
diagrams.  You  can  assume  that  the  Voronoi  diagram  algorithm  should 
be  able  to  compute  for  every  vertex  of  the  Voronoi  diagram  its  incident 
edges  in  cyclic  order  around  the  vertex. 

7.4  Prove  that  the  breakpoints  of  the  beach  line,  as  defined  in  Section  7.2, 
trace  out  the  edges  of  the  Voronoi  diagram  while  the  sweep  line  moves 
from  top  to  bottom. 

7.5  Give  an  example  where  the  parabola  defined  by  some  site  pt  contributes 
more  than  one  arc  to  the  beach  line.  Can  you  give  an  example  where  it 
contributes  a  linear  number  of  arcs? 

7.6  Give  an  example  of  six  sites  such  that  the  plane  sweep  algorithm  encoun¬ 
ters  the  six  site  events  before  any  of  the  circle  events.  The  sites  should  lie 
in  general  position:  no  three  sites  on  a  line  and  no  four  sites  on  a  circle. 

7.7  Do  the  breakpoints  of  the  beach  line  always  move  downwards  when  the 
sweep  line  moves  downwards?  Prove  this  or  give  a  counterexample. 

7.8  Write  a  procedure  to  compute  a  big  enough  bounding  box  from  the 
incomplete  doubly-connected  edge  list  and  the  tree  T  after  the  sweep  is 
completed.  The  box  should  contain  all  sites  and  all  Voronoi  vertices. 

7.9  Write  a  procedure  to  add  all  cell  records  and  the  corresponding  pointers 
to  the  incomplete  doubly-connected  edge  list  after  the  bounding  box  has 
been  added.  That  is,  fill  in  the  details  of  line  8  of  Algorithm  VoronoiDi- 
AGRAM. 

7.10  Let  P  be  a  set  of  n  points  in  the  plane.  Give  an  O(n\ogn)  time  algorithm 

_  to  find  two  points  in  P  that  are  closest  together.  Show  that  your  algorithm 

170  is  correct. 


7.1 1  Let  P  be  a  set  of  n  points  in  the  plane.  Give  an  O(nlogn)  time  algorithm  Section  7.6 

to  find  for  each  point  p  in  P  another  point  in  P  that  is  closest  to  it.  exercises 

7.12  Let  the  Voronoi  diagram  of  a  point  set  P  be  stored  in  a  doubly-connected 
edge  list  inside  a  bounding  box.  Give  an  algorithm  to  compute  all  points 

of  P  that  lie  on  the  boundary  of  the  convex  hull  of  P  in  time  linear  in  the  * 

output  size.  Assume  that  your  algorithm  receives  as  its  input  a  pointer  to 

the  record  of  some  half-edge  whose  origin  lies  on  the  bounding  box.  J 

7.13  For  each  of  the  ten  breakpoints  shown  in  Figure  7.5,  determine  which  of 
the  five  types  it  corresponds  to. 

7.14  Show  that  the  farthest  point  Voronoi  diagram  on  n  points  in  the  plane 
has  at  most  2 n  —  3  (bounded  or  unbounded)  edges.  Also  give  an  exact 
bound  on  the  maximum  number  of  vertices  in  the  farthest  point  Voronoi 
diagram. 

7.15  Show  that  the  smallest  width  annulus  cannot  be  constructed  with  ran¬ 
domized  incremental  construction.  To  this  end,  show  that  a  point  pi 
can  be  added  to  a  set  Pi- \  that  does  not  lie  in  the  minimum  width  annu¬ 
lus,  but  does  not  lie  on  the  boundary  of  the  smallest  width  annulus  of 
Pi  :=  Pi-i  D  {pi}. 

7.16  Show  that  for  some  set  P  of  n  points,  there  can  be  iliir)  intersections 
between  the  edges  of  the  Voronoi  diagram  and  the  farthest  site  Voronoi 
diagram. 

7.17  Show  that  if  there  are  only  O(n)  intersections  between  the  edges  of  the 
Voronoi  diagram  and  the  farthest  site  Voronoi  diagram,  then  the  smallest 
width  annulus  can  be  computed  in  0{n\ogn)  expected  time. 

7.18*  In  the  Voronoi  assignment  model  the  goods  or  services  that  the  consumers 
want  to  acquire  have  the  same  market  price  at  every  site.  Suppose  this  is 
not  the  case,  and  that  the  price  of  the  good  at  site  /?,•  is  w,-.  The  trading  areas 
of  the  sites  now  correspond  to  the  cells  in  the  weighted  Voronoi  diagram 
of  the  sites  (see  Section  7.5),  where  site  pi  has  an  additive  weight  vv,. 

Generalize  the  sweep  line  algorithm  of  Section  7.2  to  this  case. 

7.19*  Suppose  that  we  are  given  a  subdivision  of  the  plane  into  n  convex  regions. 

We  suspect  that  this  subdivision  is  a  Voronoi  diagram,  but  we  do  not  know 
the  sites.  Develop  an  algorithm  that  finds  a  set  of  n  point  sites  whose 
Voronoi  diagram  is  exactly  the  given  subdivision,  if  such  a  set  exists. 
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8  Arrangements  and  Duality 

Supersampling  in  Ray  Tracing 


Computer  generated  images  of  3-dimensional  scenes  are  becoming  more  and 
more  realistic.  Nowadays,  they  can  hardly  be  distinguished  from  photographs. 
A  technique  that  has  played  an  important  role  in  this  development  is  ray  tracing. 
It  works  as  follows. 


Figure  8. 1 

Determining  visible  objects  using  ray 
tracing 


The  screen  of  a  computer  monitor  is  composed  of  many  small  dots,  called 
pixels.  A  nice  screen  consists  of,  say,  1280  by  1024  pixels.  Suppose  that  we 
are  given  a  3-dimensional  scene  consisting  of  several  objects,  a  light  source, 
and  a  view  point.  Generating  an  image  of  this  scene — also  called  rendering  the 
scene — amounts  to  determining  for  every  pixel  on  the  screen  which  object  is 
visible  at  that  pixel,  and  determining  the  intensity  of  light  that  the  object  emits 
in  the  direction  of  the  view  point  at  that  particular  point.  Let’s  first  look  at  the 
first  task,  determining  the  visible  object  at  each  pixel.  Ray  tracing  performs  this 
task  by  shooting  a  ray  through  each  pixel,  as  in  Figure  8.1.  The  first  object  that 
is  hit  is  the  one  that  is  visible  at  the  pixel.  Once  the  visible  object  is  determined, 
one  has  to  compute  the  intensity  of  light  emitted  by  the  object  at  the  visible 
point.  Here  we  have  to  take  into  account  how  much  light  that  point  receives 
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from  the  light  source,  either  directly  or  indirectly  via  reflections  on  other  objects. 
The  strength  of  ray  tracing  is  that  it  can  perform  this  second  task — which  is 
crucial  to  getting  a  realistic  image — quite  well.  In  this  chapter,  however,  we  are 
mainly  interested  in  the  first  part. 

There  is  one  issue  in  determining  the  visible  object  for  each  pixel  that  we 
have  swept  under  the  rug:  a  pixel  is  not  a  point,  but  a  small  square  area.  In 
general  this  is  not  a  problem.  Most  pixels  are  covered  completely  by  one  of  the 
objects,  and  shooting  a  ray  through  the  pixel  center  will  tell  us  which  object 
this  is.  But  problems  arise  near  the  edges  of  the  objects.  When  an  edge  of  an 
object  crosses  a  pixel,  the  object  may  cover  49%  of  the  pixel  area,  but  be  missed 
by  the  ray  through  the  pixel  center.  If,  on  the  other  hand,  the  object  covered 
51%  of  the  pixel  area,  it  would  be  hit  by  the  ray  and  we  would  falsely  assume 
that  the  whole  pixel  was  covered.  This  results  in  the  well-known  jaggies  in  the 
image.  This  artifact  is  diminished  if  we  had  not  just  two  categories,  “hit”  and 
“miss”,  but  also  categories  like  “49%  hit”.  Then  we  could  set  the  intensity  of 
the  pixel  to,  say,  0.49  times  the  object  intensity.  Or  if  there  is  more  than  one 
object  visible  inside  the  pixel,  we  could  make  the  pixel  intensity  a  mixture  of 
the  object  intensities. 

How  can  we  incorporate  such  different  pixel  intensities  in  our  ray  tracing 
scheme?  The  solution  is  to  shoot  more  than  one  ray  per  pixel.  If,  for  instance,  we 
shoot  100  rays  per  pixel  and  35  of  them  hit  an  object  then  we  expect  that  object 
to  be  visible  in  roughly  35%  of  the  pixel  area.  This  is  called  supersampling: 
instead  of  taking  one  sample  point  per  pixel,  we  take  many. 

How  should  we  distribute  the  rays  over  the  pixel  to  make  this  work?  An 
obvious  choice  would  be  to  distribute  them  regularly;  for  100  rays  this  would 
mean  having  a  regular  grid  of  10  by  10  sample  points  in  the  pixel.  Indeed,  if 
35  of  these  rays  hit  the  object  then  the  area  where  it  is  visible  cannot  be  far 
from  35%.  There  is,  however,  a  disadvantage  to  choosing  a  regular  sample 
pattern:  although  the  error  at  each  particular  pixel  will  be  small,  there  will  be  a 
certain  regularity  across  rows  (and  columns)  of  pixels.  Regularity  in  the  errors 
triggers  the  human  visual  system,  and  as  a  result  we  see  annoying  artifacts.  So 
choosing  a  regular  sample  pattern  isn’t  such  a  good  idea.  It’s  better  to  choose 
the  sample  points  in  a  somewhat  random  fashion.  Of  course  not  every  random 
pattern  works  equally  well;  we  still  want  the  sample  points  to  be  distributed  in 
such  a  way  that  the  number  of  hits  is  close  to  the  percentage  of  covered  area. 


Suppose  that  we  have  generated  a  random  set  of  sample  points.  We’d  like  to 
have  a  way  of  deciding  whether  this  set  is  good.  What  we  want  is  that  the 
difference  between  the  percentage  of  hits  for  an  object  and  the  percentage  of 
the  pixel  area  where  that  object  is  visible  is  small.  This  difference  is  called 
the  discrepancy  of  the  sample  set  with  respect  to  the  object.  Of  course  we  do 
not  know  in  advance  which  objects  will  be  visible  in  the  pixel,  so  we  have  to 
prepare  for  the  worst-case  scenario:  we  want  the  maximum  discrepancy  over  all 
possible  ways  that  an  object  can  be  visible  inside  the  pixel  to  be  small.  This  is 
called  the  discrepancy  of  the  sample  set ,  and  it  depends  on  the  type  of  objects 

_  that  are  present  in  the  scene.  So  formally  the  discrepancy  of  a  sample  set  is 

174  defined  with  respect  to  a  given  class  of  objects.  Based  on  the  discrepancy  of  the 


given  set  of  sample  points  we  can  decide  if  it  is  good  enough:  if  the  discrepancy  Section  8.1 

is  low  enough  we  decide  to  keep  it,  and  otherwise  we  generate  a  new  random  computing  the  discrepancy 

set.  For  this  we  need  an  algorithm  that  computes  the  discrepancy  of  a  given 

point  set. 


8.1  Computing  the  Discrepancy 

We  mentioned  above  that  the  discrepancy  of  a  point  set  is  defined  with  respect 
to  a  class  of  objects.  The  objects  that  we  must  consider  are  the  projections  of 
the  3-dimensional  objects  that  make  up  our  scene.  As  is  common  in  computer 
graphics,  we  assume  that  curved  objects  are  approximated  using  polygonal 
meshes.  So  the  2-dimensional  objects  that  we  must  consider  are  the  projections 
of  the  facets  of  polyhedra.  In  other  words,  we  are  interested  in  the  discrepancy 
with  respect  to  the  class  of  polygons.  In  general  scenes,  most  pixels  will  be 
crossed  by  at  most  one  edge  of  a  given  polygon,  unless  the  scene  consists 
of  many  polygons  that  are  extremely  thin  or  tiny.  If  a  pixel  is  intersected  by 
one  polygon  edge,  the  polygon  behaves  inside  the  pixel  like  a  half-plane.  The 
situation  that  a  pixel  is  intersected  by  more  polygon  edges  is  much  less  common. 
Also,  it  doesn’t  cause  regularity  in  the  error,  which  was  the  source  of  the 
annoying  artifacts.  Therefore  we  restrict  our  attention  to  half-plane  discrepancy. 

Let  U  :=  [0  :  1]  x  [0  :  1]  be  the  unit  square — U  models  a  pixel — and  let  S  be  a 
set  of  n  sample  points  in  U.  Let  Jf  denote  the  (infinite)  set  of  all  possible  closed 
half-planes.  The  continuous  measure  of  a  half-plane  h  £  TC  is  defined  as  the 
area  of  h  fl  U.  We  denote  it  by  Ll  (h).  For  a  half-plane  h  that  completely  covers 
U  we  have  /i(/z)  =  1,  for  instance.  The  discrete  measure  of  h,  which  we  denote 
as  Hs(h),  is  defined  as  the  fraction  of  the  sample  points  that  is  contained  in  h, 
so  Hs(h)  :=  card(Sn/i)/ card(S),  where  card(-)  denotes  the  cardinality  of  a  set. 
The  discrepancy  of  h  with  respect  to  the  sample  set  S.  denoted  as  A s(/i),  is  the 
absolute  difference  between  the  continuous  and  the  discrete  measure: 

A s(h)  :=  \p{h)  - ns(h)\. 

The  discrepancy  of  the  half-plane  in  the  margin  figure,  for  example,  is  |0.25  — 
0.3 1  =  0.05.  Finally,  the  half-plane  discrepancy  of  S  is  the  supremum  of  the 
discrepancies  over  all  possible  half-planes: 

AW(S)  :=  sup  A s(h). 

hew 

We  have  defined  what  it  is  that  we  want  to  compute.  Now  let’s  see  how  to 
compute  it. 

The  supremum  of  the  discrepancy  over  all  closed  half-planes  equals  the  max¬ 
imum  discrepancy  over  all  open  or  closed  half-planes.  To  get  the  search  for 
the  half-plane  with  maximum  discrepancy  started,  we  first  identify  a  finite  set 
of  candidate  half-planes.  It’s  always  a  good  idea  to  replace  an  infinite  set  of 
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Chapter  8  candidates  by  a  finite  set,  provided  the  latter  contains  the  ones  we’re  interested 

arrangements  and  duality  in.  So  the  finite  set  we  identify  must  contain  the  half-plane  that  has  maximum 

discrepancy.  We’ll  select  each  half-plane  that  locally  has  maximal  discrep¬ 
ancy.  This  means  that  if  we  translated  or  rotated  such  a  half-plane  slightly,  its 
discrepancy  would  decrease.  One  of  these  half-planes  with  locally  maximal 
discrepancy  will  be  the  one  that  gives  the  real  maximum. 

Any  half-plane  that  does  not  contain  a  point  of  S  on  its  boundary  can  be 
translated  slightly  such  that  its  continuous  measure  increases  while  its  discrete 
measure  remains  the  same.  A  slight  translation  in  the  opposite  direction  will 
decrease  the  continuous  measure  while  the  discrete  measure  remains  the  same. 
So  one  of  the  two  translations  results  in  an  increase  in  the  discrepancy.  Hence, 
the  half-plane  we  are  looking  for  has  a  point  of  S  on  its  boundary.  Now  consider 
a  half-plane  h  having  only  one  point  p  £  S  on  its  boundary.  Can  we  always 
rotate  h  around  p  such  that  the  discrepancy  increases?  In  other  words,  does 
the  half-plane  with  the  maximum  discrepancy  always  have  two  points  on  its 
boundary?  The  answer  is  no:  when  we  rotate  h  around  p  we  can  reach  a  local 
extremum  in  the  continuous  measure  function.  Suppose  this  local  extremum  is  a 
local  maximum.  Then  any  slight  rotation  will  decrease  the  continuous  measure. 
If  the  discrete  measure  is  smaller  than  the  continuous  measure  at  the  local 
maximum,  rotating  will  decrease  the  discrepancy.  Similarly,  any  slight  rotation 
at  a  local  minimum  of  the  continuous  measure  where  the  discrete  measure  is 
greater  than  the  continuous  measure  will  decrease  the  discrepancy.  Hence,  the 
maximum  discrepancy  could  very  well  be  achieved  at  such  an  extremum. 


Let’s  investigate  the  extrema  more  closely.  Let  p  :=  (px,Py)  be  a  point  in 
S.  For  0  ^  (j)  <  2n.  let  £p(<p)  be  the  line  through  p  that  makes  an  angle  (j) 
with  the  positive  x-axis.  Consider  the  continuous  measure  function  of  the  half¬ 
plane  initially  lying  above  £p((j>)\  we  denote  this  half-plane  by  hp((j)).  We  are 
interested  in  the  local  extrema  of  the  function  <j>  i— >  p  (hp(ty)).  When  (f>  increases 
from  0  to  2it,  the  line  £p ( (j) )  rotates  around  p.  First  of  all.  an  extremum  may 
occur  when  £p(<p)  sweeps  over  one  of  the  vertices  of  U.  This  happens  at  most 
eight  times.  In  between  two  such  events  £p(0)  intersects  two  fixed  edges  of 
U.  A  little  calculation  yields  the  continuous  measure  functions  for  the  various 
cases  that  occur.  For  example,  when  £p((j>)  intersects  the  top  and  left  boundary 
of  U  we  have 

H(hP{<l>))  =  \{\-py  +  pA&n<!))(px+l— ^). 

In  this  case  there  are  at  most  two  local  extrema.  The  continuous  measure 
function  is  similar  when  £p(<p)  intersects  two  other  edges  of  U,  so  we  may 
conclude  that  there  is  a  constant  number  of  local  extrema  per  point  p  £  S.  Thus 
the  total  number  of  candidate  half-planes  with  one  point  on  their  boundary  is 
0(n).  Moreover,  we  can  find  the  extrema  and  the  corresponding  half-planes  in 
0(1)  time  per  point.  We  have  proved  the  following  lemma. 

Lemma  8.1  Let  S  be  a  set  of  n  points  in  the  unit  square  U.  A  half-plane  h  that 
achieves  the  maximum  discrepancy  with  respect  to  S  is  of  one  of  the  following 
types: 
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(i)  h  contains  one  point  p  £  S  on  its  boundary.  Section  8.2 

(ii)  h  contains  two  or  more  points  of  S  on  its  boundary.  duality 

The  number  of  type  (i)  candidates  is  0(n),  and  they  can  be  found  in  0(n)  time. 

The  number  of  type  (ii)  candidates  is  clearly  quadratic.  Because  the  number 
of  type  (i)  candidates  is  much  smaller  than  this,  we  treat  them  in  a  brute-force 
way:  for  each  of  the  0(n)  half-planes  we  compute  their  continuous  measure  in 
constant  time,  and  their  discrete  measure  in  0(n)  time.  This  way  the  maximum 
of  the  discrepancies  of  these  half-planes  can  be  computed  in  0{n2)  time.  For 
the  type  (ii)  candidates  we  must  be  more  careful  when  computing  the  discrete 
measures.  For  this  we  need  some  new  techniques.  In  the  remainder  of  this 
chapter  we  introduce  these  techniques  and  we  show  how  to  use  them  to  compute 
all  discrete  measures  in  0(n2)  time.  We  can  then  compute  the  discrepancy 
of  these  half-planes  in  constant  time  per  half-plane,  and  take  the  maximum. 

Finally,  by  comparing  this  maximum  to  the  maximum  discrepancy  of  the  type 
(i)  candidates  we  find  the  discrepancy  of  S.  This  leads  to  the  following  theorem. 

Theorem  8.2  The  half-plane  discrepancy  of  a  set  S  of  n  points  in  the  unit  square 
can  be  computed  in  0(n2)  time. 


8.2  Duality 


A  point  in  the  plane  has  two  parameters:  its  x-coordinate  and  its  y-coordinate. 
A  (non-vertical)  line  in  the  plane  also  has  two  parameters:  its  slope  and  its 
intersection  with  the  y-axis.  Therefore  we  can  map  a  set  of  points  to  a  set  of 
lines,  and  vice  versa,  in  a  one-to-one  manner.  We  can  even  do  this  in  such  a  way 
that  certain  properties  of  the  set  of  points  translate  to  certain  other  properties 
for  the  set  of  lines.  For  instance,  three  points  on  a  line  become  three  lines 
through  a  point.  Several  different  mappings  that  achieve  this  are  possible;  they 


primal  plane 


dual  plane 


Figure  8.2 

An  example  of  duality 


are  called  duality  transforms.  The  image  of  an  object  under  a  duality  transform 
is  called  the  dual  of  the  object.  A  simple  duality  transform  is  the  following. 
Let  p  :=  (px-Py)  be  a  point  in  the  plane.  The  dual  of  p,  denoted  p* ,  is  the  line 
defined  as 


P*  ■=  (y  =  PxX~py). 


Ill 
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£*  :=  (m,—b). 

The  duality  transform  is  not  defined  for  vertical  lines.  In  most  cases  vertical 
lines  can  be  handled  separately,  so  this  is  not  a  problem.  Another  solution  is  to 
rotate  the  scene  so  that  there  are  no  vertical  lines. 

We  say  that  the  duality  transform  maps  objects  from  the  primal  plane  to  the 
dual  plane.  Certain  properties  that  hold  in  the  primal  plane  also  hold  in  the  dual 
plane: 

Observation  8.3  Let  p  be  a  point  in  the  plane  and  let  £  be  a  non-vertical  line  in 
the  plane.  The  duality  transform  o  i— >■  o'  has  the  following  properties, 
m  It  is  incidence  preserving:  p  £  £  if  and  only  if  £*  £  p*. 
m  It  is  order  preserving:  p  lies  above  £  if  and  only  if  £*  lies  above  p* . 

Figure  8.2  illustrates  these  properties.  The  three  points  p\ ,  p2,  and  p 3  lie  on 
the  line  £  in  the  primal  plane;  the  three  lines  pi*,  p2* ,  and  p-\ *  go  through  the 
point  £*  in  the  dual  plane.  The  point  p4  lies  above  the  line  £  in  the  primal  plane; 
the  point  £*  lies  above  the  line  p4*  in  the  dual  plane. 


The  duality  transform  can  also  be  applied  to  other  objects  than  points  and  lines. 
What  would  be  the  dual  of  a  line  segment  s  :=  ~pq,  for  example?  A  logical  choice 
for  s*  is  the  union  of  the  duals  of  all  points  on  .v.  What  we  get  is  an  infinite  set 
of  lines.  All  the  points  on  s  are  collinear,  so  all  the  dual  lines  pass  through  one 
point.  Their  union  forms  a  double  wedge,  which  is  bounded  by  the  duals  of  the 
endpoints  of  s.  The  lines  dual  to  the  endpoints  of define  two  double  wedges,  a 
left-right  wedge  and  a  top-bottom  wedge;  s*  is  the  left-right  wedge.  Figure  8.3 
shows  the  dual  of  a  segment  ,v.  It  also  shows  a  line  £  intersecting  s,  whose  dual 

primal  plane  dual  plane 


Figure  8.3 

The  dual  transform  applied  to  a  line 
segment 


£*  lies  in  s* .  This  is  not  a  coincidence:  any  line  that  intersects  s  must  have  either 
p  or  q  above  it  and  the  other  point  below  it,  so  the  dual  of  such  a  line  lies  in  s* 
by  the  order  preserving  property  of  the  dual  transform. 


The  dual  transform  defined  above  has  a  nice  geometric  interpretation.  Let  II 
denote  the  parabola  11 :  y  =  x2/2.  Let’s  first  look  at  the  dual  of  a  point  p  that 
lies  on  II.  The  derivative  of  II  at  p  is  px,  so  p*  has  the  same  slope  as  the  tangent 

_  line  of  II  at  p.  As  a  matter  of  fact,  the  dual  of  a  point  p  £  U  is  the  tangent  line 

178  at  p,  because  the  intersection  of  the  tangent  with  the  y-axis  is  (0,  —px/2).  Now 


suppose  that  a  point  q  does  not  lie  on  IX.  What  is  the  slope  of  q*l  Well,  any  two 
points  on  the  same  vertical  line  have  duals  with  equal  slope.  In  particular,  q*  is 
parallel  to  p* ,  where  p  is  the  point  that  lies  on  'll  and  has  the  same  x-coordinate 
as  q.  Let  q'  be  the  point  with  the  same  x-coordinate  as  q  (and  as  p)  such  that 
q'y  —  py  =  py  —  qy.  The  vertical  distance  between  the  duals  of  points  with  the 
same  x-coordinate  is  equal  to  the  difference  in  v-coordi  nates  of  these  points. 
Hence,  q*  is  the  line  through  q'  that  is  parallel  to  the  tangent  of  ll  at  p. 

When  you  think  about  duality  for  a  few  minutes  you  may  wonder  how  duality 
can  be  useful.  If  you  can  solve  a  problem  in  the  dual  plane,  you  could  have 
solved  it  in  the  primal  plane  as  well  by  mimicking  the  solution  to  the  dual 
problem  in  the  primal  plane.  After  all.  the  primal  and  dual  problems  are 
essentially  the  same.  Still,  transforming  a  problem  to  the  dual  plane  has  one 
important  advantage:  it  provides  a  new  perspective.  Looking  at  a  problem  from 
a  different  angle  can  give  the  insight  needed  to  solve  it. 

Let’s  see  what  happens  when  we  consider  the  discrepancy  problem  in  the 
dual  plane.  In  the  previous  section  we  were  left  with  the  following  problem: 
Given  a  set  S  of  n  points,  compute  the  discrete  measure  of  every  half-plane 
bounded  by  a  line  through  two  of  the  points.  When  we  dualize  the  set  S  of  points 
we  get  a  set  S*  :=  {p*  :  p  £  S}  of  lines.  Let  P(p,q)  denote  the  line  through  two 
points  p,q  £  S.  The  dual  of  this  line  is  the  intersection  point  of  the  two  lines 
p*,q*  £  S*.  Consider  the  open  half-plane  bounded  by  and  below  t(p.q).  The 
discrete  measure  of  this  half-plane  is  the  number  of  points  strictly  below  £(p,  q). 
This  means  that  in  the  dual  plane  we  are  interested  in  the  number  of  lines  strictly 
above  £(p,q)*.  For  the  closed  half-plane  below  Up-q)  we  must  also  take  the 
lines  through  £(p,q)*  into  account.  Similarly,  for  the  half-plane  bounded  by  and 
above  £(p,q)  we  are  interested  in  the  number  of  lines  below  £(p,q)*.  In  the  next 
section  we  study  sets  of  lines,  and  we  give  an  efficient  algorithm  to  compute  the 
number  of  lines  above  every  intersection  point,  through  every  intersection  point, 
and  below  every  intersection  point.  When  we  apply  this  algorithm  to  S*  we  get 
all  the  information  we  need  to  compute  the  discrete  measure  of  all  half-planes 
bounded  by  lines  through  two  points  in  S. 

There  is  one  thing  that  we  should  be  careful  about:  two  points  in  S  with  the 
same  x-coordinate  dualize  to  lines  with  the  same  slope.  So  the  line  through  these 
points  does  not  show  up  as  an  intersection  in  the  dual  plane.  This  makes  sense, 
because  the  dual  transform  is  undefined  for  vertical  lines.  In  our  application  this 
calls  for  an  additional  step.  For  every  vertical  line  through  at  least  two  points,  we 
must  determine  the  discrete  measures  of  the  corresponding  half-planes.  Since 
there  is  only  a  linear  number  of  vertical  lines  through  two  (or  more)  points  in  S , 
the  discrete  measures  for  these  lines  can  be  computed  in  a  brute-force  manner 
in  0(n2)  time  in  total. 
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8.3  Arrangements  of  Lines 

Let  L  be  a  set  of  n  lines  in  the  plane.  The  set  L  induces  a  subdivision  of  the  _ 

plane  that  consists  of  vertices,  edges,  and  faces.  Some  of  the  edges  and  faces  179 
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are  unbounded.  This  subdivision  is  usually  referred  to  as  the  arrangement 
induced  by  L ,  and  it  is  denoted  by  A(L).  An  arrangement  is  called  simple  if 
no  three  lines  pass  through  the  same  point  and  no  two  lines  are  parallel.  The 
( combinatorial )  complexity  of  an  arrangement  is  the  total  number  of  vertices, 
edges,  and  faces  of  the  arrangement.  Arrangements  of  lines  and  their  higher¬ 
dimensional  counterparts  occur  frequently  in  computational  geometry.  Often  a 
problem  that  is  defined  on  a  set  of  points  is  dualized  and  turned  into  a  problem 
on  arrangements.  This  is  done  because  the  structure  of  a  line  arrangement  is 
more  apparent  than  the  structure  of  a  point  set.  A  line  through  a  pair  of  points 
in  the  primal  plane,  for  instance,  becomes  a  vertex  in  the  dual  arrangement — a 
much  more  explicit  feature.  The  extra  structure  in  an  arrangement  does  not 
come  for  free:  constructing  a  full  arrangement  is  a  time-  and  storage-consuming 
task,  because  the  combinatorial  complexity  of  an  arrangement  is  high. 


Theorem  8.4  Let  L  be  a  set  ofn  lines  in  the  plane,  and  let  A(L)  be  the  arrange¬ 
ment  induced  by  L. 

(i)  The  number  of  vertices  ofA(L)  is  atmostnfi  —  l)/2. 

(ii)  The  number  of  edges  of  A(L)  is  at  most  n2. 

(Hi)  The  number  of  faces  of  A{L)  is  at  most  rr  /2  +  n/2  +  1 . 

Equality  holds  in  these  three  statements  if  and  only  if  A(L)  is  simple. 


Proof.  The  vertices  of  A(L)  are  the  intersection  points  of  pairs  of  lines  in  L. 
Hence,  there  are  at  most  n(n  —  l)/2  of  them.  This  number  of  vertices  is  achieved 
if  and  only  if  every  pair  of  lines  gives  rise  to  a  unique  intersection  point,  which 
happens  exactly  if  A(L)  is  simple. 

The  number  of  edges  on  a  line  is  one  more  than  the  number  of  vertices  on 
that  line.  The  latter  number  is  at  most  n  —  1,  so  the  number  of  edges  on  a  line 
is  bounded  by  n.  This  gives  at  most  ir  edges  in  total,  which  is  achieved  if  and 
only  if  A(L')  is  simple. 

To  bound  the  number  of  faces  of  A(L)  we  add  the  lines  one  by  one  and 
bound  the  increase  in  the  number  of  faces  at  each  step.  Let  L:=  .. ,  (n  \ . 

For  1  ^  i  ^  n,  define  L,  :  =  { /  | . . . . ,  } .  What  is  the  increase  in  the  number  of 
faces  when  we  add  and  go  from  A(Lj- 1)  to  A(L,)?  Every  edge  on  £;  splits  a 
face  of  A(Lj- 1)  into  two.  Hence,  the  number  of  faces  increase  by  the  number 
of  edges  of  A(L,_i )  on  f,-.  The  latter  number  is  at  most  i,  so  the  total  number  of 
faces  is  at  most 

n 

l  +  £i  =  n2/2  +  n/2+l. 
i=  1 

Again,  this  is  achieved  if  and  only  if  A{L)  is  simple.  0 


So  the  arrangement  A{L)  induced  by  a  set  L  of  lines  is  a  planar  subdivision 
of  at  most  quadratic  complexity.  The  doubly-connected  edge  list  seems  a 
suitable  way  to  store  an  arrangement;  with  this  representation  we  can  efficiently 
list  the  edges  of  a  given  face,  step  from  one  face  to  a  neighboring  one,  and  so 
on.  A  doubly-connected  edge  list,  however,  can  only  store  bounded  edges,  and 

_  an  arrangement  also  has  a  number  of  unbounded  edges.  Therefore  we  place  a 

180  large  bounding  box  that  encloses  the  interesting  part  of  the  arrangement,  that  is. 


a  bounding  box  that  contains  all  vertices  of  the  arrangement  in  its  interior.  The 
subdivision  defined  by  the  bounding  box  plus  the  part  of  the  arrangement  inside 
it  has  bounded  edges  only  and  can  be  stored  in  a  doubly-connected  edge  list. 

How  can  we  construct  this  doubly-connected  edge  list?  The  approach  that 
immediately  comes  to  mind  is  plane  sweep.  In  Chapter  2  all  intersection  points 
of  a  set  of  line  segments  were  computed  by  plane  sweep,  and  this  algorithm  was 
subsequently  used  to  compute  the  doubly-connected  edge  list  for  the  overlay  of 
two  planar  subdivisions.  Indeed,  it’s  not  so  difficult  to  adapt  the  algorithms  of 
Chapter  2  to  compute  the  arrangement  A(L).  Since  the  number  of  intersection 
points  is  quadratic,  the  algorithm  would  run  in  0{n 2  log  n)  time.  Not  bad,  but  not 
optimal.  So  let’s  try  another  approach  that  may  come  to  mind:  an  incremental 
construction  algorithm. 

A  bounding  box  'B (L)  that  contains  all  vertices  of  A(L)  in  its  interior  can 
easily  be  computed  in  quadratic  time:  compute  all  intersection  points  of  pairs 
of  lines,  and  choose  the  leftmost  one,  the  rightmost  one,  the  bottom  one,  and 
the  top  one.  An  axis-parallel  rectangle  that  contains  these  four  points  contains 
all  vertices  of  the  arrangement. 
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The  incremental  algorithm  adds  the  lines  £\Ai,  ■■■An  one  after  the  other  and 
updates  the  doubly-connected  edge  list  after  each  addition.  Let  A\  denote  the 
subdivision  of  the  plane  induced  by  the  bounding  box  '13  (L)  and  the  part  of 
A({£ i, . . .  ,£i})  inside  B(L).  To  add  the  line  we  must  split  the  faces  in  i 
that  are  intersected  by  f,-.  We  can  find  these  faces  by  walking  along  £;  from  left 
to  right,  as  follows.  Suppose  we  enter  a  face  /  through  an  edge  e.  We  walk 


Figure  8.4 

Traversing  the  arrangement 


along  the  boundary  of  /  following  Atetf  ()-pointers  in  the  doubly-connected 
edge  list  until  we  find  the  half-edge  of  the  edge  e '  where  /:,■  leaves  /.  We  then 
step  to  the  next  face  using  the  TwinQ  -pointer  of  that  half-edge  to  reach  the  other 
half-edge  for  e1  in  the  doubly-connected  edge  list.  This  way  we  find  the  next 
face  in  time  proportional  to  the  complexity  of  /.  It  can  also  happen  that  we  leave 
/  through  a  vertex  v.  In  this  case  we  walk  around  v,  visiting  its  incident  edges, 
until  we  find  the  next  face  intersected  by  The  doubly-connected  edge  list 


181 


Chapter  8  allows  us  to  do  this  in  time  proportional  to  the  degree  of  v.  Figure  8.4  illustrates 
arrangements  and  duality  how  we  traverse  the  arrangement. 

Two  things  are  left:  how  do  we  find  the  leftmost  edge  intersected  by  /■ — this 
is  the  edge  where  we  start  the  walk  through  A.,_i — and  how  do  we  actually  split 
the  faces  we  encounter? 

The  first  issue  is  easy.  The  leftmost  intersection  point  of  and  A\.  i  is  an 
edge  on  23  (L).  We  simply  test  all  of  them  to  locate  the  one  where  the  traversal 
can  be  started.  The  face  incident  to  this  edge  and  inside  23  (L)  is  the  first  face 
that  is  split  by  In  case  lj  intersects  A,_  i  first  in  a  corner  of  23  (L),  the  first 
face  split  by  /■  is  the  unique  face  incident  to  this  corner  and  inside  13  (L).  If  /■  is 
a  vertical  line  we  can  locate  the  bottom  intersection  point  of  £,  and  A,_i  to  start 
off  the  traversal.  Since  A,_i  contains  at  most  2/ +  2  edges  on  13  (L),  the  time 
needed  for  this  step  is  linear  for  each  line. 

Suppose  we  have  to  split  a  face  /,  and  assume  that  the  face  intersected  by 
ii  to  the  left  of  /  has  already  been  split.  In  particular,  we  assume  that  the  edge 
e  where  we  enter  /  has  already  been  split.  Splitting  /  is  done  as  follows — see 
Figure  8.5.  First  of  all  we  create  two  new  face  records,  one  for  the  part  of  / 


Figure  8.5 
Splitting  a  face 


above  /  and  one  for  the  part  of  /  below  £,.  Next  we  split  e' .  the  edge  where 
/■  leaves  /,  and  create  a  new  vertex  for  (,  C\e' .  Thus  we  create  one  new  vertex 
record,  and  two  new  half-edge  records  for  both  new  edges.  (If  /:,■  leaves  / 
through  a  vertex,  then  this  step  is  omitted.)  Furthermore,  we  create  half-edge 
records  for  the  edge  fl/.  It  remains  to  correctly  initialize  the  various  pointers 
in  the  new  face,  vertex,  and  half-edge  records,  set  some  existing  pointers  to  the 
new  vertex  record,  half-edge  records,  and  face  records,  and  destroy  the  face 
record  for  /  and  the  half-edge  records  for  e' .  This  is  done  in  the  same  way  as  in 
Section  2.3,  where  the  overlay  of  two  subdivisions  was  constructed.  The  total 
time  for  the  split  is  linear  in  the  complexity  of  /. 

The  algorithm  for  constructing  an  arrangement  can  be  summarized  as  follows: 

Algorithm  ConstructArrangement(L) 

Input.  A  set  L  of  n  lines  in  the  plane. 

Output.  The  doubly-connected  edge  list  for  the  subdivision  induced  by  23  (L) 
and  the  part  of  A(L)  inside  23  (L),  where  23  (L)  is  a  bounding  box  containing 
all  vertices  of  A(L)  in  its  interior. 
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1.  Compute  a  bounding  box  3{L)  that  contains  all  vertices  of  A(L)  in  its  Section  8.3 

interior.  arrangements  of  lines 

2.  Construct  the  doubly-connected  edge  list  for  the  subdivision  induced  by 

23(L). 

3.  for  i  < —  1  to  n 

4.  do  Find  the  edge  e  on  3  (L)  that  contains  the  leftmost  intersection  point 

of  and  A,. 

5.  f  <—  the  bounded  face  incident  to  e 

6.  while  /  is  not  the  unbounded  face,  that  is,  the  face  outside  3 (L) 

7.  do  Split  /,  and  set  /  to  be  the  next  intersected  face. 

We  have  given  a  simple  incremental  algorithm  for  computing  an  arrangement. 

Next  we  analyze  its  running  time.  Step  1  of  the  algorithm,  computing  3 (L), 
can  be  done  in  0(n2)  time.  Step  2  takes  only  constant  time.  Finding  the  first 
face  split  by  /:,■  takes  O(n)  time,  as  we  noted  before.  We  now  bound  the  time  it 
takes  to  split  the  faces  intersected  by  f,-. 

First,  assume  that  A(L)  is  simple.  In  this  case  the  time  we  spend  to  split 
a  face  /  and  to  find  the  next  intersected  face  is  linear  in  the  complexity  of  /. 

Hence,  the  total  time  we  need  to  insert  line  f,  is  linear  in  the  sum  of  the 
complexities  of  the  faces  of  71,-1  intersected  by  f,-.  When  A(L)  is  not  simple, 
we  may  leave  /  through  a  vertex  v.  In  that  case  we  have  to  walk  around  v  to 
find  the  next  face  to  split,  and  we  encounter  edges  that  are  not  on  the  boundary 
of  an  intersected  face.  But  notice  that  the  edges  we  encounter  in  this  case  are  on 
the  boundary  of  faces  whose  closure  is  intersected  by  £,  .  This  leads  us  to  the 
concept  of  zones. 


The  zone  of  a  line  t  in  the  arrangement  A(L)  induced  by  a  set  L  of  lines  in  the 


Figure  8.6 

The  zone  of  a  line  in  an  arrangement  of 
lines 


plane  is  the  set  of  faces  of  A(L)  whose  closure  intersects  £.  Figure  8.6  gives  an 
example  of  a  zone  consisting  of  nine  faces.  The  complexity  of  a  zone  is  defined 
as  the  total  complexity  of  all  faces  it  consists  of,  that  is,  the  sum  of  the  number 
of  edges  and  vertices  of  these  faces.  In  Figure  8.6  you  can  see  that  some  vertices 
are  counted  once  in  the  zone  complexity,  others  are  counted  twice,  three  times, 
or  even  four  times.  The  time  we  need  to  insert  line  f,-  is  linear  in  the  complexity 

of  the  zone  of  i in  A  ( {f  i , . . . ,  l ,■ } ) .  The  Zone  Theorem  tells  us  that  this  quantity  _ 

is  linear:  183 


Chapter  8  Theorem  8.5  (Zone  Theorem)  The  complexity  of  the  zone  of  a  line  in  an 
arrangements  and  duality  arrangement  ofm  lines  in  the  plane  is  0{m). 
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Proof.  Let  L  be  a  set  of  m  lines  in  the  plane,  and  let  £  be  another  line.  Without 
loss  of  generality  we  assume  that  l  coincides  with  the  x-axis;  we  can  change 
the  coordinate  system  for  this.  We  assume  that  no  line  of  L  is  horizontal.  This 
assumption  is  removed  at  the  end  of  the  proof. 

Each  edge  in  7l(L)  bounds  two  faces.  We  say  that  an  edge  is  a  left  bounding 
edge  for  the  face  lying  to  the  right  of  it  and  a  right  bounding  edge  for  the  face 
lying  to  the  left  of  it.  We  shall  prove  that  the  number  of  left  bounding  edges 
of  the  faces  in  the  zone  of  t  is  at  most  5m.  By  symmetry,  the  number  of  right 
bounding  edges  is  then  bounded  by  5m  as  well,  and  the  theorem  follows. 

The  proof  is  by  induction  on  m.  The  base  case,  m  =  1,  is  trivially  true.  Now 
let  m  >  1.  Of  the  lines  in  L,  let  £\  be  the  one  that  has  the  rightmost  intersection 
with  i.  We  first  assume  that  this  line  is  uniquely  defined.  By  induction,  the  zone 
of  i  in  j4(L\  {fi})  has  at  most  5 (in  —  1)  left  bounding  edges.  When  we  add  the 
line  1 1 ,  the  number  of  left  bounding  edges  increases  in  two  ways:  there  are  new 
left  bounding  edges  on  £\  and  there  are  old  left  bounding  edges  that  are  split  by 
i  i .  Let  v  be  the  first  intersection  point  of  f\  with  another  line  in  L  above  £,  and 
let  vv  be  the  first  intersection  point  of  £\  below  £.  The  edge  connecting  v  and  w 
is  a  new  left  bounding  edge  on  l  \ .  Furthermore,  i\  splits  a  left  bounding  edge 
at  the  points  v  and  w.  This  adds  up  to  an  increase  of  three  in  the  number  of  left 
bounding  edges.  If  v  or  w  doesn’t  exist,  the  increase  is  even  less.  We  claim  that 
this  is  the  only  increase. 

Consider  the  part  of  i\  above  v.  Let  1 2  be  a  line  that  intersects  £  \  at  v.  The 
region  above  v  enclosed  by  £\  and  £2  is  not  in  the  zone  of  l.  Because  £ 2  crosses 
i\  from  left  to  right  at  v,  the  region  lies  to  the  right  of  l\ .  Hence,  the  part  of  t\ 
above  v  cannot  contribute  any  left  bounding  edges  to  the  zone.  Moreover,  if  a 
left  bounding  edge  e  that  was  in  the  zone  is  intersected  by  l\  somewhere  above 
v,  then  the  part  of  e  to  the  right  of  £\  is  no  longer  in  the  zone.  Hence,  there  is  no 
increase  in  the  number  of  left  bounding  edges  due  to  such  an  intersection. 

In  the  same  way  it  can  be  shown  that  the  part  of  t \  below  w  does  not  increase 
the  number  of  left  bounding  edges  in  the  zone  of  £.  Therefore  the  total  increase 
is  at  most  three,  as  claimed.  The  total  number  of  left  bounding  edges  in  this 
case  is  therefore  at  most  5  (in  —  1 )  +  3  <  5m. 

Up  to  now  we  assumed  that  the  line  f  \  through  the  rightmost  intersection 
point  on  £  is  unique.  If  there  is  more  than  one  line  passing  through  the  rightmost 
intersection  point,  then  we  take  an  arbitrary  one  to  be  f\ .  Following  the  same 
arguments  as  above,  the  increase  in  the  number  of  left  bounding  edges  can  be 
shown  to  be  at  most  five.  (If  more  than  two  lines  pass  through  the  intersection 
point,  the  increase  is  at  most  four.  If  exactly  two  lines  pass  through  it,  the 
increase  is  at  most  five.)  Hence,  the  total  number  of  left  bounding  edges  is  at 
most  5  (m  —  1 )  +  5  =  5m. 

Finally,  we  remove  the  assumption  that  no  line  of  L  is  horizontal.  For  a 
horizontal  line  that  doesn’t  coincide  with  £,  a  slight  rotation  only  increases  the 
complexity  of  the  zone  of  £  in  A(L).  Since  we  are  proving  an  upper  bound  on 
the  zone  complexity,  we  can  safely  assume  such  lines  don’t  exist.  If  L  contains 


a  line  £,  that  coincides  with  £,  then  the  proof  above  shows  that  the  zone  of  £  in 
A(L\  {£,-})  has  at  most  10 m—  10  edges,  and  the  addition  of  /.,■  increases  this 
quantity  by  at  most  4 m  —  2:  at  most  m  edges  on  £,  for  the  faces  above  f,-,  at  most 
m  edges  on  £j  for  the  faces  below  £t,  and  at  most  m  —  1  edges  are  split  into  two, 
each  of  which  is  counted  as  a  left  bounding  edge  and  a  right  bounding  edge. 
This  concludes  the  proof  of  the  Zone  Theorem.  0 

We  can  now  bound  the  running  time  of  the  incremental  algorithm  for  con¬ 
structing  an  arrangement.  We  have  seen  that  the  time  needed  to  insert  £,  is  linear 
in  the  complexity  of  the  zone  of  £j  in  A({£\ , .  i }).  By  the  Zone  Theorem 
this  is  <9(i),  so  the  time  required  to  insert  all  lines  is 

1 0(0  =  0(n2). 

i=i 

Steps  1-2  of  the  algorithm  together  take  <9(h2)  time,  so  the  total  running  time 
of  the  algorithm  is  <9(h2).  Because  the  complexity  of  A(L)  is  0(h2)  when  A(L) 
is  simple,  our  algorithm  is  optimal. 

Theorem  8.6  A  doubly-connected  edge  list  for  the  arrangement  induced  by  a 
set  of  n  lines  in  the  plane  can  be  constructed  in  0{n2)  time. 


8.4  Levels  and  Discrepancy 

It’s  time  to  go  back  to  the  discrepancy  problem.  We  had  dualized  the  set  S  of  n 
sample  points  into  a  set  S*  of  n  lines,  and  we  needed  to  compute  for  every  vertex 
of  A  (S*)  how  many  lines  lie  above  it,  pass  through  it,  and  lie  below  it.  For  each 
vertex,  these  three  numbers  add  up  to  exactly  n,  so  it  is  sufficient  to  compute 
two  of  the  numbers.  After  we  have  constructed  a  doubly-connected  edge  list  for 
A  (S*)  we  know  how  many  lines  pass  through  each  vertex.  We  define  the  level 
of  a  point  in  an  arrangement  of  lines  to  be  the  number  of  lines  strictly  above  it. 
We  next  show  how  to  compute  the  level  of  each  vertex  in  A  (5*). 

To  compute  the  levels  of  the  vertices  of  A. (5*)  we  do  the  following  for  each 
line  £  £  S*.  First,  we  compute  the  level  of  the  leftmost  vertex  on  £  in  (9(h)  time, 
by  checking  for  each  of  the  remaining  lines  whether  it  lies  strictly  above  that 
vertex.  Next  we  walk  along  £  from  left  to  right  to  visit  the  other  vertices  on  £, 
using  the  doubly-connected  edge  list.  It  is  easy  to  maintain  the  level  while  we 
walk:  the  level  only  changes  at  a  vertex,  and  the  change  can  be  computed  by 
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in  an  arrangement 


Figure  8. 7 

Maintaining  the  level  while  walking 
along  a  line 
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Chapter  8  inspecting  the  edges  incident  to  the  vertex  that  is  encountered.  In  Figure  8.7,  for 

arrangements  and  duality  instance,  the  leftmost  vertex  on  i  has  level  one.  The  points  on  the  edge  incident 

to  that  vertex  and  going  to  the  right  also  have  level  one.  At  the  second  vertex 
a  line  crosses  i  coming  from  above;  the  level  decreases  by  one  and  becomes 
zero.  Since  the  level  is  defined  as  the  number  of  lines  strictly  above  a  point,  the 
level  of  the  second  vertex  itself  is  also  zero.  At  the  third  vertex  a  line  crosses  £ 
coming  from  below.  Hence,  the  level  increases  to  one  after  the  vertex  is  passed; 
the  vertex  itself  still  has  level  zero.  And  so  on.  Note  that  we  needn’t  worry 
about  vertical  lines;  our  set  is  obtained  by  dualizing  a  set  of  points.  This  way 
the  time  to  compute  the  levels  of  the  vertices  on  t  is  0(n).  Hence,  the  levels  of 
all  vertices  of  A(5*)  can  be  computed  in  0{n2)  time. 

The  number  of  lines  above,  through,  and  below  each  vertex  of  A (S*)  gives 
us  all  the  information  we  need  to  compute  the  discrete  measure  of  the  half-planes 
bounded  by  lines  containing  two  points  in  S.  Hence,  these  discrete  measures 
can  be  computed  in  0(n2)  time.  This  finally  finishes  the  proof  of  Theorem  8.2. 


8.5  Notes  and  Comments 
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In  this  chapter  some  important  non-algorithmic  concepts  were  introduced:  geo¬ 
metric  duality  and  arrangements.  Duality  is  a  transform  that  can  shed  a  different 
light  on  a  geometric  problem,  and  is  a  standard  tool  for  the  computational 
geometer.  The  duality  transform  of  Section  8.2  is  not  defined  for  vertical  lines. 
Usually,  vertical  lines  can  be  treated  as  a  special  case  or  by  a  perturbation  of  the 
setting.  There  exist  different  duality  transforms  that  can  handle  vertical  lines, 
but  these  have  other  drawbacks — see  Edelsbrunner’s  book  [158].  Duality  ap¬ 
plies  to  higher-dimensional  point  sets  as  well.  For  a  point  p  =  (p\,p2  -  ■  ■  ■  ,p,i), 

its  dual  p*  is  the  hyperplane  xj  =  p\X\  +  P2X2  H - V  pd-\Xd-\  —  Pd-  For 

a  hyperplane  Xd  =  a\x\  +  CI2X2  +  •  •  •  +  ad~\Xd~\  +  ad ,  its  dual  is  the  point 
(a\ ,Q2,  ■  ■ . ,  Ud  - 1  ■  —ad)-  The  transform  is  incidence  and  order  preserving. 

Recall  that  with  the  geometric  interpretation  of  the  duality  transform  using 
the  parabola  y  =  x2 / 2,  the  dual  of  any  point  can  be  constructed.  Interestingly,  the 
dual  of  a  point  q  can  also  be  constructed  without  measuring  distances.  Assume 
that  q  lies  below  11.  Draw  the  two  lines  that  pass  through  q  and  are  tangent  to  If. 
The  line  q*  is  the  line  through  the  two  points  where  these  tangents  touch  If.  By 
construction,  point  q  is  the  intersection  of  the  two  tangent  lines.  Hence,  the  dual 
of  q  must  go  through  the  duals  of  these  two  tangents,  which  are  the  points  where 
the  tangents  touch  'll.  The  dual  of  a  point  above  If  can  be  constructed  without 
measuring  distances  as  well.  We  won’t  show  how  to  do  this  here.  (One  hint: 
you  will  need  to  be  able  to  draw  the  line  through  a  given  point  that  is  parallel  to 
a  given  line.) 

Another  geometric  transform  that  has  been  applied  successfully  in  computa¬ 
tional  geometry  is  inversion.  It  can  change  the  point-inside-circle  relation  in  the 
plane  to  a  point-below-plane  relation  in  3-dimensional  space.  More  specifically, 
a  point  p  :=  (px.py)  is  lifted  to  the  unit  paraboloid  z  =  x2  +y2  in  3-dimensional 


space,  so 
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P°  ■=  (. Px,Py,Px  +  Py )• 

A  circle  C  :=  (x  — a)2  +  (y  —  b)2  =  r2  in  the  plane  is  transformed  to  a  plane  in 
3 -dimensional  space  by  lifting  the  circle  to  the  unit  paraboloid  and  taking  the 
plane  through  the  lifted  circle.  In  particular, 

CP  :=  (z  =  a(x  —  a)  +  b(y  —  b)  +  r2) . 

Now  p  lies  inside  C  if  and  only  if  p'  is  below  CP.  This  transform  can  be 
extended  to  higher  dimensions,  where  a  hypersphere  in  7-dimensional  space 
becomes  a  hyperplane  in  (d  +  1) -dimensional  space. 

Arrangements  have  been  studied  extensively  in  computational  and  combinato¬ 
rial  geometry.  Arrangements  are  not  restricted  to  the  plane.  A  3-dimensional 
arrangement  is  induced  by  a  set  of  planes,  and  a  higher-dimensional  arrange¬ 
ment  is  induced  by  a  set  of  hyperplanes.  The  book  of  Edelsbrunner  [158]  is  an 
excellent  exposition  of  the  research  on  arrangements  up  to  1987.  It  also  contains 
the  references  to  earlier  textbooks  on  combinatorial — but  not  computational — 
geometry.  For  a  more  recent  survey  see  the  handbook  chapter  by  Halperin  [206]. 
We  list  a  selection  of  results  on  arrangements  in  the  plane  and  in  higher  dimen¬ 
sions. 

The  complexity  of  an  arrangement  of  n  hyperplanes  in  7-dimensional  space 
is  Q(nd)  in  the  worst  case.  Any  simple  arrangement — one  where  any  d  hyper¬ 
planes  but  no  d  +  1  hyperplanes  intersect  in  one  point — achieves  this  bound. 
Edelsbrunner  et  al.  [165]  presented  the  first  optimal  algorithm  for  constructing 
arrangements.  The  optimality  of  this  incremental  construction  algorithm  de¬ 
pends  on  a  higher-dimensional  version  of  the  Zone  Theorem,  which  states  that 
the  zone  of  a  hyperplane  in  an  arrangement  of  n  hyperplanes  in  7-dimensional 
space  has  complexity  0(nd~ 1 ) .  A  proof  of  this  theorem  is  given  by  Edelsbrunner 
etal.  [168]. 

The  concept  of  levels  in  arrangements  extends  to  higher  dimensions  as 
well — see  Edelsbrunner’ s  book  [158].  The  /.-level  in  an  arrangement  .A (77) 
induced  by  a  set  H  of  n  hyperplanes  is  defined  as  the  set  of  points  with  at  most 
k—  1  hyperplanes  strictly  above  it,  and  at  most  n  —  k  hyperplanes  strictly  below. 
Tight  bounds  on  the  maximum  complexity  of  7-levels  are  still  unknown,  even 
in  the  planar  case.  In  the  dual  setting,  the  problem  is  closely  related  to  the 
following  question:  given  an  set  of  n  points,  how  many  subsets  of  k  points 
can  be  separated  from  the  other  n  —  k  points  by  a  hyperplane?  Such  subsets 
are  called  k-sets ,  and  the  maximum  number  of  7-sets  in  a  set  of  n  points  is 
again  unknown.  For  the  planar  case — both  for  7-sets  and  7-levels — Erdos  et 
al.  [174]  in  1973  proved  a  lower  bound  of  £2(«log(7  +  1))  and  an  upper  bound 
of  0(ns/k).  This  was  were  the  problem  stood  for  a  long  time  (except  for  a  slight 
improvement  of  the  upper  bound  by  Pach  et  al.  [313]  to  0(«v/7/log*(7  +  1))) 
until  in  1997  Dey  [143,  144]  managed  to  prove  an  0(nk */3)  upper  bound,  which 
is  now  the  best  known  bound. 

Given  a  set  of  n  points  in  the  plane,  how  many  subsets  of  at  most  7  points  can 
be  separated  from  the  other  n—k  points  by  a  line?  Such  subsets  are  called  7)- 
sets.  Unlike  for  7-sets,  tight  bounds  on  the  maximum  number  of  ('7  7)-sets 
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Chapter  8  are  known.  In  the  plane  the  maximum  number  is  &(nk),  and  in  ^/-dimensional 
arrangements  and  duality  space  it  is  &(n ld/2i  ),  which  was  shown  by  Clarkson  and  Shor  [133].  The 

same  bounds  hold  for  (V  Zr)-levels  in  arrangements. 

In  the  notes  and  comments  of  Chapter  7,  a  connection  between  Voronoi 
diagrams  and  convex  polyhedra  in  one  dimension  higher  was  explained:  the 
Voronoi  diagram  of  a  set  of  points  in  the  plane  is  the  same  as  the  projection  of  the 
boundary  of  the  common  intersection  of  a  set  of  half-spaces  in  3-dimensional 
space.  This  boundary  is  in  fact  the  0-level  of  the  arrangement  of  planes  bounding 
these  half-spaces.  This  connection  extends  to  order-/:  Voronoi  diagrams  and 
Zi-levels  in  arrangements:  the  k-level  in  the  same  arrangement  of  planes  projects 
down  to  the  order-/:  Voronoi  diagram  of  the  points. 

Arrangements  can  be  defined  for  objects  other  than  lines  and  hyperplanes. 
A  set  of  line  segments  in  the  plane,  for  instance,  also  forms  an  arrangement. 
For  such  arrangements,  even  bounds  on  the  maximum  complexity  of  a  single 
face  aren’t  simple  to  prove.  Since  faces  can  be  non-convex,  line  segments  can 
appear  several  times  on  the  boundary.  Indeed,  the  maximum  complexity  of  a 
single  face  can  be  superlinear:  it  is  &(na(n))  in  the  worst  case,  where  a(/i)  is 
the  extremely  slowly  growing  functional  inverse  of  Ackermann’s  function.  The 
upper  bound  can  be  proved  using  Davenport-Schinzel  sequences;  the  interested 
reader  is  referred  to  the  book  by  Sharir  and  Agarwal  [353]. 

The  main  motivation  for  studying  combinatorial  structures  like  arrange¬ 
ments,  single  cells  in  arrangements,  and  envelopes,  lies  in  motion  planning. 
Several  motion  planning  problems  can  be  formulated  as  problems  on  arrange¬ 
ments  and  their  substructures  [201,  207,  208,  231,  342,  343]. 

Our  original  motivation  for  studying  arrangements  arose  from  computer  graphics 
and  the  quality  of  random  samples.  The  use  of  discrepancy  was  introduced  to 
computer  graphics  by  Shirley  [358],  and  developed  algorithmically  by  Dobkin 
and  Mitchell  [150],  Dobkin  and  Eppstein  [149],  Chazelle  [96],  and  de  Berg  [50]. 


8.6  Exercises 

8.1  Prove  that  the  duality  transform  introduced  in  this  chapter  is  indeed 
incidence  and  order  preserving,  as  claimed  in  Observation  8.3. 

8.2  The  dual  of  a  line  segment  is  a  left-right  double  wedge,  as  was  shown  in 
Section  8.2. 

a.  What  is  the  dual  of  the  collection  of  points  inside  a  given  triangle  with 
vertices  p,  q ,  and  r? 

b.  What  type  of  object  in  the  primal  plane  would  dualize  to  a  top-bottom 
double  wedge? 

8.3  Use  Euler’s  formula  to  show  that  the  maximum  number  of  faces  is  n2  / 2  + 
n/ 2+1  for  an  arrangement  with  n(n  —  l)/2  vertices  and  n2  edges. 
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8.4  Let  L  be  a  set  of  n  lines  in  the  plane.  Give  an  0(ii  log/;)  time  algorithm  to  Section  8.6 
compute  an  axis-parallel  rectangle  that  contains  all  the  vertices  of  A(L')  exercises 
in  its  interior. 

8.5  Let  S  be  a  set  of  n  points  in  the  plane.  In  this  chapter  an  algorithm  was 
given  to  determine  for  every  line  l  through  two  points  of  S  how  many 
points  of  S  lie  strictly  above  L  This  was  done  by  dualizing  the  problem 
first.  Transform  the  algorithm  for  the  dual  problem  back  to  the  primal 
plane,  and  give  the  corresponding  0(n2)  time  algorithm  for  the  given 
problem.  (This  exercise  should  help  you  to  appreciate  duality.) 

8.6  Let  S  be  a  set  of  n  points  in  the  plane  and  let  L  be  a  set  of  m  lines  in  the 
plane.  Suppose  we  wish  to  determine  whether  there  is  a  point  in  S  that 
lies  on  a  line  in  L.  What  is  the  dual  of  this  problem? 

8.7  Let  R  be  a  set  of  n  red  points  in  the  plane,  and  let  B  be  a  set  of  n  blue 
points  in  the  plane.  We  call  a  line  l  a  separator  for  R  and  B  if  f  has 
all  points  of  R  to  one  side  and  all  points  of  B  to  the  other  side.  Give  a 
randomized  algorithm  that  can  decide  in  0(n )  expected  time  whether  R 
and  B  have  a  separator. 

8.8  The  dual  transform  of  Section  8.2  has  minus  signs.  Suppose  we  change 
them  to  plus  signs,  so  the  dual  of  a  point  (px,Py)  is  the  line  y  =  pxx  +  py, 
and  the  dual  of  the  line  y  =  mx  +  b  is  the  point  ( m,b ).  Is  this  dual 
transform  incidence  and  order  preserving? 

8.9  Let  P  be  a  set  of  n  points  in  the  plane.  Let  p  £  P  be  one  of  these  points. 

Give  a  randomized  algorithm  that  can  decide  in  0(n )  expected  time 
whether  p  is  a  vertex  of  the  convex  hull  of  P. 

8.10  Let  L  be  a  set  of  n  non-vertical  lines  in  the  plane.  Suppose  the  arrange¬ 
ment  A  (L)  only  has  vertices  with  level  0.  What  can  you  say  about  this 
arrangement?  Next  suppose  that  lines  of  L  can  be  vertical.  What  can  you 
say  now  about  the  arrangement? 

8.11  Let  L  be  a  set  of  lines  in  the  plane,  and  let  /  be  the  face  of  A  (L)  containing 
the  origin.  Describe  the  set  of  lines  that  are  the  duals  of  the  points  in  /. 

Also  describe  the  dual  of  a  vertex  of  /;  distinguish  between  a  vertex  that 
is  the  intersection  of  two  lines  that  both  pass  above  the  origin,  a  vertex 
that  is  the  intersection  of  two  lines  that  both  pass  below  the  origin,  and  a 
vertex  that  is  the  intersection  of  one  line  passing  above  the  origin  and  one 
line  passing  below  it. 

8.12  While  constructing  the  arrangement  of  a  set  L  of  lines,  we  traversed 
every  line  of  L  from  left  to  right  when  we  added  it.  When  computing 
the  discrepancy,  we  needed  the  level  of  each  vertex  in  the  arrangement. 

To  determine  these  levels,  we  traversed  every  line  of  L  from  left  to  right 
again.  Is  it  possible  to  combine  these  two  traversals,  that  is,  can  you  add 
the  lines  to  the  arrangement  and  compute  the  levels  of  the  intersection 
points  immediately? 
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8.13  Given  a  set  L  of  n  lines  in  the  plane,  give  an  O(n\ogn)  time  algorithm  to 
compute  the  maximum  level  of  any  vertex  in  the  arrangement  A(L). 


8.14  Let  S  be  a  set  of  n  points  in  the  plane.  Give  an  0(n2)  time  algorithm  to 
find  the  line  containing  the  maximum  number  of  points  in  S. 


8.15  Let  S  be  a  set  of  n  segments  in  the  plane.  We  want  to  preprocess  S  into  a 
data  structure  that  can  answer  the  following  query:  Given  a  query  line  l, 
how  many  segments  in  5  does  it  intersect? 

a.  Formulate  the  problem  in  the  dual  plane. 

b.  Describe  a  data  structure  for  this  problem  that  uses  0(n 2)  expected 
storage  and  has  O(\ogn)  expected  query  time. 

c.  Describe  how  the  data  structure  can  be  built  in  0(n2  logn)  expected 
time. 


8.16  Let  S  be  a  set  of  n  segments  in  the  plane.  A  line  £  that  intersects  all 
segments  of  S  is  called  a  transversal  or  stabber  for  S. 

a.  Give  an  0{n 2)  algorithm  to  decide  if  a  stabber  exists  for  S. 

b.  Now  assume  that  all  segments  are  vertical.  Give  a  randomized  algo¬ 
rithm  with  0(n)  expected  running  time  that  decides  if  a  stabber  exists 
for  S. 
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9  Delaunay  Triangulations 

Height  Interpolation 


When  we  talked  about  maps  of  a  piece  of  the  earth’s  surface  in  previous  chapters, 
we  implicitly  assumed  there  is  no  relief.  This  may  be  reasonable  for  a  country 
like  the  Netherlands,  but  it  is  a  bad  assumption  for  Switzerland.  In  this  chapter 
we  set  out  to  remedy  this  situation. 

We  can  model  a  piece  of  the  earth’s  surface  as  a  terrain.  A  terrain  is  a 
2-dimensional  surface  in  3-dimensional  space  with  a  special  property:  every 
vertical  line  intersects  it  in  a  point,  if  it  intersects  it  at  all.  In  other  words,  it 
is  the  graph  of  a  function  /  :AcR2-*R  that  assigns  a  height  f(p)  to  every 
point  p  in  the  domain ,  A,  of  the  terrain.  (The  earth  is  round,  so  on  a  global 
scale  terrains  defined  in  this  manner  are  not  a  good  model  of  the  earth.  But 
on  a  more  local  scale  terrains  provide  a  fairly  good  model.)  A  terrain  can  be 
visualized  with  a  perspective  drawing  like  the  one  in  Figure  9.1,  or  with  contour 
lines — lines  of  equal  height — like  on  a  topographic  map. 


Figure  9.1 

A  perspective  view  of  a  terrain 


Of  course,  we  don’t  know  the  height  of  every  point  on  earth;  we  only  know  it 
where  we’ve  measured  it.  This  means  that  when  we  talk  about  some  terrain,  we 
only  know  the  value  of  the  function  /  at  a  finite  set  P  C  A  of  sample  points.  From 
the  height  of  the  sample  points  we  somehow  have  to  approximate  the  height 
at  the  other  points  in  the  domain.  A  naive  approach  assigns  to  every  p  £  A  the 
height  of  the  nearest  sample  point.  However,  this  gives  a  discrete  terrain,  which 
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doesn’t  look  very  natural.  Therefore  our  approach  for  approximating  a  terrain 
is  as  follows.  We  first  determine  a  triangulation  of  P:  a  planar  subdivision 
whose  bounded  faces  are  ttiangles  and  whose  vertices  are  the  points  of  P.  (We 
assume  that  the  sample  points  are  such  that  we  can  make  the  Uiangles  cover 
the  domain  of  the  terrain.  )  We  then  lift  each  sample  point  to  its  correct  height, 
thereby  mapping  every  triangle  in  the  triangulation  to  a  Uiangle  in  3-space. 
Figure  9.2  illustrates  this.  What  we  get  is  a  polyhedral  terrain ,  the  graph  of  a 
continuous  function  that  is  piecewise  linear.  We  can  use  the  polyhedral  terrain 
as  an  approximation  of  the  original  terrain. 


Figure  9.2 

Obtaining  a  polyhedral  terrain  from  a 
set  of  sample  points 


The  question  remains:  how  do  we  triangulate  the  set  of  sample  points?  In 
general,  this  can  be  done  in  many  different  ways.  But  which  triangulation  is  the 
most  appropriate  one  for  our  purpose,  namely  to  approximate  a  terrain?  There 
is  no  definitive  answer  to  this  question.  We  do  not  know  the  original  terrain,  we 
only  know  its  height  at  the  sample  points.  Since  we  have  no  other  information, 
and  the  height  at  the  sample  points  is  the  correct  height  for  any  triangulation,  all 
triangulations  of  P  seem  equally  good.  Nevertheless,  some  triangulations  look 
more  natural  than  others.  For  example,  have  a  look  at  Figure  9.3,  which  shows 
two  triangulations  of  the  same  point  set.  From  the  heights  of  the  sample  points 
we  get  the  impression  that  the  sample  points  were  taken  from  a  mountain  ridge. 
Triangulation  (a)  reflects  this  intuition.  Triangulation  (b),  however,  where  one 
single  edge  has  been  “flipped,”  has  introduced  a  narrow  valley  cutting  through 
the  mountain  ridge.  Intuitively,  this  looks  wrong.  Can  we  turn  this  intuition  into 
a  criterion  that  tells  us  that  triangulation  (a)  is  better  than  triangulation  (b)? 


Figure  9.3 

Flipping  one  edge  can  make  a  big 
difference 
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The  problem  with  triangulation  (b)  is  that  the  height  of  the  point  q  is  deter- 


mined  by  two  points  that  are  relatively  far  away.  This  happens  because  q  lies  in  Section  9.1 

the  middle  of  an  edge  of  two  long  and  sharp  triangles.  The  skinniness  of  these  triangulations  of  planar  point 

triangles  causes  the  trouble.  So  it  seems  that  a  triangulation  that  contains  small  sets 

angles  is  bad.  Therefore  we  will  rank  triangulations  by  comparing  their  smallest 

angle.  If  the  minimum  angles  of  two  triangulations  are  identical,  then  we  can 

look  at  the  second  smallest  angle,  and  so  on.  Since  there  is  only  a  finite  number 

of  different  triangulations  of  a  given  point  set  P,  this  implies  that  there  must  be 

an  optimal  triangulation,  one  that  maximizes  the  minimum  angle.  This  will  be 

the  triangulation  we  are  looking  for. 


9.1  Triangulations  of  Planar  Point  Sets 


Let  P  :=  {p\,pii  ■  ■  ■  ,Pn}  be  a  set  of  points  in  the  plane.  To  be  able  to  formally 
define  a  triangulation  of  P,  we  first  define  a  maximal  planar  subdivision  as 
a  subdivision  S  such  that  no  edge  connecting  two  vertices  can  be  added  to 
S  without  destroying  its  planarity.  In  other  words,  any  edge  that  is  not  in  S 
intersects  one  of  the  existing  edges.  A  triangulation  of  P  is  now  defined  as  a 
maximal  planar  subdivision  whose  vertex  set  is  P. 

With  this  definition  it  is  obvious  that  a  triangulation  exists.  But  does  it 
consist  of  triangles?  Yes,  every  face  except  the  unbounded  one  must  be  a 
triangle:  a  bounded  face  is  a  polygon,  and  we  have  seen  in  Chapter  3  that  any 
polygon  can  be  triangulated.  What  about  the  unbounded  face?  It  is  not  difficult 
to  see  that  any  segment  connecting  two  consecutive  points  on  the  boundary  of 
the  convex  hull  of  P  is  an  edge  in  any  triangulation  T.  This  implies  that  the 
union  of  the  bounded  faces  of  T  is  always  the  convex  hull  of  P,  and  that  the 
unbounded  face  is  always  the  complement  of  the  convex  hull.  (In  our  application 
this  means  that  if  the  domain  is  a  rectangular  area,  say,  we  have  to  make  sure 
that  the  corners  of  the  domain  are  included  in  the  set  of  sample  points,  so  that 
the  triangles  in  the  triangulation  cover  the  domain  of  the  terrain.  )  The  number 
of  triangles  is  the  same  in  any  triangulation  of  P.  This  also  holds  for  the  number 
of  edges.  The  exact  numbers  depend  on  the  number  of  points  in  P  that  are  on 
the  boundary  of  the  convex  hull  of  P.  (Here  we  also  count  points  in  the  interior 
of  convex  hull  edges.  Hence,  the  number  of  points  on  the  convex  hull  boundary 
is  not  necessarily  the  same  as  the  number  of  convex  hull  vertices.)  This  is  made 
precise  in  the  following  theorem. 

Theorem  9.1  Let  P  be  a  set  of  n  points  in  the  plane,  not  all  collinear,  and  let  k 
denote  the  number  of  points  in  P  that  lie  on  the  boundary  of  the  convex  hull 
of  P.  Then  any  triangulation  of  P  has  2n  2  k  triangles  and  3n  —  3  —  k  edges. 

Proof.  Let  T  be  a  triangulation  of  P,  and  let  m  denote  the  number  of  triangles 
of  T.  Note  that  the  number  of  faces  of  the  triangulation,  which  we  denote  by 
tip ,  is  m+  1.  Every  triangle  has  three  edges,  and  the  unbounded  face  has  k 
edges.  Furthermore,  every  edge  is  incident  to  exactly  two  faces.  Hence,  the 
total  number  of  edges  of  T  is  ne  :=  (3m  +  k) /2.  Euler’s  formula  tells  us  that 


n  —  ne  +  nf  =  2. 
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Chapter  9  Plugging  the  values  for  ne  and  n  f  into  the  formula,  we  get  m  =  2n  —  2  —  k,  which 

delaunay  triangulations  in  turn  implies  ne  =  3n  —  3  —  k.  ED 

Let  “The  a  triangulation  of  P,  and  suppose  it  has  m  triangles.  Consider  the 
3m  angles  of  the  triangles  of  T,  sorted  by  increasing  value.  Let  a,\ ,  a2, . . . ,  cc^m 
be  the  resulting  sequence  of  angles;  hence,  a,  ^  <Xp  for  ;  <  j.  We  call  A(T)  := 
(ai ,  CC2,  .  .  . ,  O';;,,, )  the  angle-vector  of  T.  Let  7'  be  another  triangulation  of  the 
same  point  set  P,  and  let  A  (7')  :=  (a\  .(fl.  ...  ■  0t'^m)  be  its  angle-vector.  We 
say  that  the  angle-vector  of  7  is  larger  than  the  angle-vector  of  7'  if  A(T)  is 
lexicographically  larger  than  A(T'),  or,  in  other  words,  if  there  exists  an  index  i 
with  1  ^  i  ^  3m  such  that 

a;-  =  a'j  for  all  j  <  i,  and  a,  >  aj. 

We  denote  this  as  A(T)  >  A(T').  A  triangulation  7  is  called  angle-optimal  if 
A(T)  ^  A(T')  for  all  triangulations  T  of  P.  Angle-optimal  triangulations  are 
interesting  because,  as  we  have  seen  in  the  introduction  to  this  chapter,  they  are 
good  triangulations  if  we  want  to  construct  a  polyhedral  terrain  from  a  set  of 
sample  points. 


Below  we  will  study  when  a  triangulation  is  angle-optimal.  To  do  this  it  is 
useful  to  know  the  following  theorem,  often  called  Thales’s  Theorem.  Denote 
the  smaller  angle  defined  by  three  points  p,  q ,  r  by  7pqr. 

Theorem  9.2  Let  C  be  a  circle,  £  a  line  intersecting  C  in  points  a  and  b,  and  p, 
q,  r,  and  s  points  lying  on  the  same  side  of  i.  Suppose  that  p  and  q  lie  on  C,  that 
r  lies  inside  C,  and  that  s  lies  outside  C.  Then 

Z arb  >  Z apb  =  Zaqb  >  Z asb . 

Now  consider  an  edge  e  =  "pip]  of  a  triangulation  T  of  P.  If  e  is  not  an  edge 
of  the  unbounded  face,  it  is  incident  to  two  triangles  PiPjPk  and  PiPjPi .  If  these 
two  triangles  form  a  convex  quadrilateral,  we  can  obtain  a  new  triangulation 
7'  by  removing  ~pip]  from  7  and  inserting  pjjTj  instead.  We  call  this  operation 
an  edge  flip.  The  only  difference  in  the  angle-vector  of  T  and  7'  are  the  six 


angles  oq, . . . ,  a(,  in  A(T),  which  are  replaced  by  a[, . . . ,  a'6  in  A(‘T/).  Figure  9.4 
illustrates  this.  We  call  the  edge  e  =  pjp~j  an  illegal  edge  if 

min  a,  <  min  a,. 

1<(<6  I 
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In  other  words,  an  edge  is  illegal  if  we  can  locally  increase  the  smallest  angle  Section  9.1 

by  flipping  that  edge.  The  following  observation  immediately  follows  from  the  triangulations  of  planar  point 

definition  of  an  illegal  edge.  sets 

Observation  9.3  Let  T  be  a  triangulation  with  an  illegal  edge  e.  Let  7'  be  the 
triangulation  obtained  from  7  by  flipping  e.  ThenA( 7')  >  A(T). 

It  turns  out  that  it  is  not  necessary  to  compute  the  angles  oq , . . . ,  OCg,  aj , . . . ,  od 
to  check  whether  a  given  edge  is  legal.  Instead,  we  can  use  the  simple  criterion 
stated  in  the  next  lemma.  The  correctness  of  this  criterion  follows  from  Thales’s 
Theorem. 


Lemma  9.4  Let  edge  pip]  be  incident  to  triangles  PiPjPk  and  PiPjPi,  and  let  C 
be  the  circle  through  p,,  pj,  and  /;/_  .  The  edge  pip]  is  illegal  if  and  only  if  the 
point  pi  lies  in  the  interior  of  C.  Furthermore,  if  the  points  pi,  pj,  pk,  pi  form 
a  convex  quadrilateral  and  do  not  lie  on  a  common  circle,  then  exactly  one  of 
PiPj  and  pkPi  is  an  illegal  edge. 

Observe  that  the  criterion  is  symmetric  in  pk  and  pp.  p\  lies  inside  the  circle 
through  p^  pj,  pk  if  and  only  if  pk  lies  inside  the  circle  through  /?,,  pj,  p /.  When 
all  four  points  lie  on  a  circle,  both  pip]  and  PkPi  are  legal.  Note  that  the  two 
triangles  incident  to  an  illegal  edge  must  form  a  convex  quadrilateral,  so  that  it 
is  always  possible  to  flip  an  illegal  edge. 

We  define  a  legal  triangulation  to  be  a  triangulation  that  does  not  contain 
any  illegal  edge.  From  the  observation  above  it  follows  that  any  angle-optimal 
triangulation  is  legal.  Computing  a  legal  triangulation  is  quite  simple,  once  we 
are  given  an  initial  triangulation.  We  simply  flip  illegal  edges  until  all  edges  are 
legal. 

Algorithm  LegalTriangulation(T) 

Input.  Some  triangulation  7  of  a  point  set  P. 

Output.  A  legal  triangulation  of  P. 

1 .  while  T  contains  an  illegal  edge  pip] 

2.  do  (*  Flip  p]p]  *) 

3.  Let  PiPjPk  and  PiPjPi  be  the  two  triangles  adjacent  to  pip]. 

4.  Remove  pip]  from  T,  and  add  pip]  instead. 

5.  return  T 

Why  does  this  algorithm  terminate?  It  follows  from  Observation  9.3  that  the 
angle-vector  of  7  increases  in  every  iteration  of  the  loop.  Since  there  is  only 
a  finite  number  of  different  triangulations  of  P,  this  proves  termination  of  the 
algorithm.  Once  it  terminates,  the  result  is  a  legal  triangulation.  Although  the 
algorithm  is  guaranteed  to  terminate,  it  is  too  slow  to  be  interesting.  We  have 
given  the  algorithm  anyway,  because  later  we  shall  need  a  similar  procedure. 
But  first  we  will  look  at  something  completely  different — or  so  it  seems. 
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Figure  9.5 
The  dual  graph  of  Vor (P) 


Figure  9.6 
The  Delaunay  graph  T>  9  (P) 


9.2  The  Delaunay  Triangulation 

Let  P  be  a  set  of  n  points — or  sites,  as  we  shall  sometimes  call  them — in  the 
plane.  Recall  from  Chapter  7  that  the  Voronoi  diagram  of  P  is  the  subdivision 
of  the  plane  into  n  regions,  one  for  each  site  in  P,  such  that  the  region  of  a 
site  p  £  P  contains  all  points  in  the  plane  for  which  p  is  the  closest  site.  The 
Voronoi  diagram  of  P  is  denoted  by  Vor(F).  The  region  of  a  site  p  is  called 


the  Voronoi  cell  of  p;  it  is  denoted  by  V(p).  In  this  section  we  will  study  the 
dual  graph  of  the  Voronoi  diagram.  This  graph  S  has  a  node  for  every  Voronoi 
cell — equivalently,  for  every  site — and  it  has  an  arc  between  two  nodes  if  the 
corresponding  cells  share  an  edge.  Note  that  this  means  that  9  has  an  arc  for 
every  edge  of  Vor(P).  As  you  can  see  in  Figure  9.5,  there  is  a  one-to-one 
correspondence  between  the  bounded  faces  of  9  and  the  vertices  of  Vor(P). 


Consider  the  straight-line  embedding  of  9,  where  the  node  corresponding 
to  the  Voronoi  cell  V(p)  is  the  point  p,  and  the  arc  connecting  the  nodes  of 
V(p)  and  V(q)  is  the  segment  pq — see  Figure  9.6.  We  call  this  embedding  the 
Delaunay  graph  of  P,  and  we  denote  it  by  DQ(P).  (Although  the  name  sounds 
French,  Delaunay  graphs  have  nothing  to  do  with  the  French  painter.  They 
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are  named  after  the  Russian  mathematician  Boris  Nikolaevich  Delone,  who  Section  9.2 

wrote  his  own  name  as  “Bopnc  HurcojiaeBMu  ZlejiOHe,”  which  would  be  the  delaunay  triangulation 

transliterated  into  English  as  “Delone.”  However,  since  his  work  was  published 

in  French — at  his  time,  the  languages  of  science  were  French  and  German — his 

name  is  better  known  in  the  French  transliteration.)  The  Delaunay  graph  of  a 

point  set  turns  out  to  have  a  number  of  surprising  properties.  The  first  is  that  it 

is  always  a  plane  graph:  no  two  edges  in  the  embedding  cross. 

Theorem  9.5  The  Delaunay  graph  of  a  planar  point  set  is  a  plane  graph. 


Proof.  To  prove  this,  we  need  a  property  of  the  edges  in  the  Voronoi  diagram 
stated  in  Theorem  7.4(h).  For  completeness  we  repeat  the  property,  phrased 
here  in  terms  of  Delaunay  graphs. 

The  edge  pip]  is  in  the  Delaunay  graph  CDS  (P)  if  and  only  if  there 
is  a  closed  disc  Qj  with  pj  and  pj  on  its  boundary  and  no  other  site 
of  P  contained  in  it.  (The  center  of  such  a  disc  lies  on  the  common 
edge  of  V(pi)  and  V(pj)i) 

Define  ttj  to  be  the  triangle  whose  vertices  are  /?,,  pj,  and  the  center  of  CJ; . 
Note  that  the  edge  of  tjj  connecting  pi  to  the  center  of  Qj  is  contained  in  V(pj); 
a  similar  observation  holds  for  pj.  Now  let  pipj  be  another  edge  of  ©S (P), 
and  define  the  circle  Cjy  and  the  triangle  ty  similar  to  the  way  Qj  and  tjj  were 
defined. 

Suppose  for  a  contradiction  that  pip]  and  pipj  intersect.  Both  pk  and  pi 
must  lie  outside  Qj  and  so  they  also  lie  outside  tjj.  This  implies  that  pjfpj  must 
intersect  one  of  the  edges  of  tjj  incident  to  the  center  of  Cjj.  Similarly,  pip] 
must  intersect  one  of  the  edges  of  Q  incident  to  the  center  of  Qj.  It  follows 
that  one  of  the  edges  of  tjj  incident  to  the  center  of  C,;-  must  intersect  one  of  the 
edges  of  tki  incident  to  the  center  of  Cfi .  But  this  contradicts  that  these  edges 
are  contained  in  disjoint  Voronoi  cells.  0 


The  Delaunay  graph  of  P  is  an  embedding  of  the  dual  graph  of  the  Voronoi 
diagram.  As  observed  earlier,  it  has  a  face  for  every  vertex  of  Vor(P).  The  edges 
around  a  face  correspond  to  the  Voronoi  edges  incident  to  the  corresponding 
Voronoi  vertex.  In  particular,  if  a  vertex  v  of  Vor(P)  is  a  vertex  of  the  Voronoi 
cells  for  the  sites  pi,P2,P3,  ■  ■  ■  jPk-  then  the  corresponding  face  /  in  ©S (P)  has 
Pi,P2,Pi,  ■  ■  ■  i Pk  ns  its  vertices.  Theorem  7.4(i)  tells  us  that  in  this  situation  the 
points  p  i ,  p2 ,  P3 , . . . ,  Pk  lie  on  a  circle  around  v,  so  we  not  only  know  that  /  is  a 
A'-gon,  but  even  that  it  is  convex. 

If  the  points  of  P  are  distributed  at  random,  the  chance  that  four  points 
happen  to  lie  on  a  circle  is  very  small.  We  will — in  this  chapter — say  that  a  set 
of  points  is  in  general  position  if  it  contains  no  four  points  on  a  circle.  If  P  is 
in  general  position,  then  all  vertices  of  the  Voronoi  diagram  have  degree  three, 
and  consequently  all  bounded  faces  of  ©S (P)  are  triangles.  This  explains  why 
©S  (P)  is  often  called  the  Delaunay  triangulation  of  P.  We  shall  be  a  bit  more 
careful,  and  will  call  ©S (P)  the  Delaunay  graph  of  P.  We  define  a  Delaunay 
triangulation  to  be  any  triangulation  obtained  by  adding  edges  to  the  Delaunay 
graph.  Since  all  faces  of  ©S (P)  are  convex,  obtaining  such  a  triangulation 
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Chapter  9  is  easy.  Observe  that  the  Delaunay  triangulation  of  P  is  unique  if  and  only  if 

delaunay  triangulations  D  S(P)  is  a  triangulation,  which  is  the  case  if  P  is  in  general  position. 
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We  now  rephrase  Theorem  7.4  about  Voronoi  diagrams  in  terms  of  Delaunay 
graphs. 

Theorem  9.6  Let  P  be  a  set  of  points  in  the  plane. 

(i)  Three  points  Pi,Pj,Pk  £  P  are  vertices  of  the  same  face  of  the  Delaunay 
graph  ofP  if  and  only  if  the  circle  through  p,,  p.,  pk  contains  no  point  of 
P  in  its  interior. 

(ii)  Two  points  Pi,Pj  £  P  form  an  edge  of  the  Delaunay  graph  of  P  if  and  only 
if  there  is  a  closed  disc  C  that  contains  /;,  and  pj  on  its  boundary  and  does 
not  contain  any  other  point  ofP. 

Theorem  9.6  readily  implies  the  following  characterization  of  Delaunay 
triangulations. 

Theorem  9.7  Let  P  be  a  set  of  points  in  the  plane,  and  let  T  be  a  triangulation 
ofP.  Then  T  is  a  Delaunay  triangulation  ofP  if  and  only  if  the  circumcircle  of 
any  triangle  of  T  does  not  contain  a  point  ofP  in  its  interior. 

Since  we  argued  before  that  a  triangulation  is  good  for  the  purpose  of  height 
interpolation  if  its  angle-vector  is  as  large  as  possible,  our  next  step  should  be 
to  look  at  the  angle-vector  of  Delaunay  triangulations.  We  do  this  by  a  slight 
detour  through  legal  triangulations. 

Theorem  9.8  Let  P  be  a  set  of  points  in  the  plane.  A  triangulation  T  of  P  is 
legal  if  and  only  if  T  is  a  Delaunay  triangulation  ofP. 

Proof.  It  follows  immediately  from  the  definitions  that  any  Delaunay  triangula¬ 
tion  is  legal. 

We  shall  prove  that  any  legal  triangulation  is  a  Delaunay  triangulation  by 
contradiction.  So  assume  T  is  a  legal  triangulation  of  P  that  is  not  a  Delaunay 
triangulation.  By  Theorem  9.6,  this  means  that  there  is  a  triangle  PiPjPk  such 
that  the  circumcircle  C(pipjpk)  contains  a  point  pi  £  P  in  its  interior.  Let 
e  :=  ~pip]  be  the  edge  of  pipjpi  such  that  the  triangle  PiPjPi  does  not  intersect 
PiPjPk ■  Of  all  such  pairs  {piPjPk, Pi)  in  T,  choose  the  one  that  maximizes  the 
angle  Lpipip  j.  Now  look  at  the  triangle  PiPjPm  adjacent  to  PiPjPk  along  e. 
Since  T  is  legal,  e  is  legal.  By  Lemma  9.4  this  implies  that  pm  does  not  lie  in  the 
interior  of  C (piPjPk)-  The  circumcircle  C(pipjpm)  of  PiPjP,,,  contains  the  part 
of  C(piPjPk)  that  is  separated  from  PiPjPk  by  e.  Consequently,  pi  £  C{pipjpm). 
Assume  that  pjpm  is  the  edge  of  PiPjPm  such  that  PjPmPi  does  not  intersect 
Pi  Pj  Pm  ■  But  now  fpjpip,„  >  fpipipj  by  Thales’s  Theorem,  contradicting  the 
definition  of  the  pair  {piPjPk,  Pi)-  0 

Since  any  angle-optimal  triangulation  must  be  legal.  Theorem  9.8  implies 
that  any  angle-optimal  triangulation  of  P  is  a  Delaunay  triangulation  of  P.  When 
P  is  in  general  position,  there  is  only  one  legal  triangulation,  which  is  then  the 
only  angle-optimal  triangulation,  namely  the  unique  Delaunay  triangulation 


that  coincides  with  the  Delaunay  graph.  When  P  is  not  in  general  position.  Section  9.3 

then  any  triangulation  of  the  Delaunay  graph  is  legal.  Not  all  these  Delaunay  computing  the  delaunay 

triangulations  need  to  be  angle-optimal.  However,  their  angle-vectors  do  not  triangulation 

differ  too  much.  Moreover,  using  Thales’s  Theorem  one  can  show  that  the 

minimum  angle  in  any  triangulation  of  a  set  of  co-circular  points  is  the  same, 

that  is,  the  minimum  angle  is  independent  of  the  triangulation.  This  implies  that 

any  triangulation  turning  the  Delaunay  graph  into  a  Delaunay  triangulation  has 

the  same  minimum  angle.  The  following  theorem  summarizes  this. 

Theorem  9.9  Let  P  be  a  set  of  points  in  the  plane.  Any  angle-optimal  trian¬ 
gulation  of  P  is  a  Delaunay  triangulation  of  P.  Furthermore,  any  Delaunay 
triangulation  ofP  maximizes  the  minimum  angle  over  all  triangulations  ofP. 


9.3  Computing  the  Delaunay  Triangulation 

We  have  seen  that  for  our  purpose — approximating  a  terrain  by  constructing  a 
polyhedral  terrain  from  a  set  P  of  sample  points — a  Delaunay  triangulation  of  P 
is  a  suitable  triangulation.  This  is  because  the  Delaunay  triangulation  maximizes 
the  minimum  angle.  So  how  do  we  compute  such  a  Delaunay  triangulation? 

We  already  know  from  Chapter  7  how  to  compute  the  Voronoi  diagram 
of  P.  From  Vor(P)  we  can  easily  obtain  the  Delaunay  graph  D S(P),  and  by 
triangulating  the  faces  with  more  than  three  vertices  we  can  obtain  a  Delaunay 
triangulation.  In  this  section  we  describe  a  different  approach:  we  will  compute 
a  Delaunay  triangulation  directly,  using  the  randomized  incremental  approach 
we  have  so  successfully  applied  to  the  linear  programming  problem  in  Chapter  4 
and  to  the  point  location  problem  in  Chapter  6. 

In  Chapter  6  we  found  it  convenient  to  start  with  a  large  rectangle  containing 
the  scene,  to  avoid  problems  caused  by  unbounded  trapezoids.  In  the  same 
spirit  we  now  start  with  a  large  triangle  that  contains  the  set  P.  We  will  add 
two  extra  points  p  \  and  p  _  2  that,  together  with  the  highest  point  po  of  P, 
form  a  triangle  containing  all  the  points.  This  means  we  are  now  computing  a 
Delaunay  triangulation  of  PU{p-i,p-2}  instead  of  the  Delaunay  triangulation 
of  P.  Later  we  want  to  obtain  the  Delaunay  triangulation  of  P  by  discarding  p  _  \ 
and  p_ 2,  together  with  all  incident  edges.  For  this  to  work  we  have  to  choose 
p- 1  and  p  -2  far  enough  away,  so  that  they  don’t  destroy  any  triangles  in  the 
Delaunay  triangulation  of  P.  In  particular,  we  must  ensure  they  do  not  lie  in 
any  circle  defined  by  three  points  in  P.  We  postpone  the  details  of  this  to  a  later 
stage;  first  we  have  a  look  at  the  algorithm. 

The  algorithm  is  randomized  incremental,  so  it  adds  the  points  in  random 
order  and  it  maintains  a  Delaunay  triangulation  of  the  current  point  set.  Consider 
the  addition  of  a  point  p,  .  We  first  find  the  triangle  of  the  current  triangulation 
that  contains  pr — how  this  is  done  will  be  explained  later — and  we  add  edges 
from  pr  to  the  vertices  of  this  triangle.  If  p,  happens  to  fall  on  an  edge  e  of 
the  triangulation,  we  have  to  add  edges  from  p,  to  the  opposite  vertices  in 
the  triangles  sharing  e.  Figure  9.7  illustrates  these  two  cases.  We  now  have 
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DELAUNAY  TRIANGULATIONS 


a  triangulation  again,  but  not  necessarily  a  Delaunay  triangulation.  This  is 
because  the  addition  of  pr  can  make  some  of  the  existing  edges  illegal.  To 


pr  lies  in  the  interior  of  a  triangle 


pr  falls  on  an  edge 


Figure  9. 7 

The  two  cases  when  adding  a  point  p, 


Pi 


remedy  this,  we  call  a  procedure  LegalizeEdge  with  each  potentially  illegal 
edge.  This  procedure  replaces  illegal  edges  by  legal  ones  through  edge  flips. 
Before  we  come  to  the  details  of  this,  we  give  a  precise  description  of  the  main 
algorithm.  It  will  be  convenient  for  the  analysis  to  let  P  be  a  set  of  n  +  1  points. 

Algorithm  DelaunayTriangulation(P) 

Input.  A  set  P  of  n  +  1  points  in  the  plane. 

Output.  A  Delaunay  triangulation  of  P. 

1 .  Let  po  be  the  lexicographically  highest  point  of  P.  that  is,  the  rightmost 
among  the  points  with  largest  y-coordinate. 

2.  Let  p- i  and  p- 2  be  two  points  in  M2  sufficiently  far  away  and  such  that  P 
is  contained  in  the  triangle  pop-ip-2- 

3.  Initialize  “X  as  the  triangulation  consisting  of  the  single  triangle  pop -1  P-2- 

4.  Compute  a  random  permutation  pi,p2,---,pn  ofP\{po}- 

5.  for  v  < —  1  to  n 

6.  do  (*  Insert  pr  into  T:  *) 

7.  Find  a  triangle  PiPjPk  €  X  containing  pr. 

8.  if  pr  lies  in  the  interior  of  the  triangle  PiPjPk 

9.  then  Add  edges  from  pr  to  the  three  vertices  of  PiPjPk ,  thereby 

splitting  PiPjPk  into  three  triangles. 

10.  LEGALIZEEDGEOr.piTTy,?) 

11.  LegalizeEdge^.pTpLT) 

12.  LEGALIZEEDGE(pr,pfp7,T) 

13.  else  (*  pr  lies  on  an  edge  of  PiPjPk ,  say  the  edge  pip]  *) 

14.  Add  edges  from  p,  to  pi  and  to  the  third  vertex  pi  of  the 
other  triangle  that  is  incident  to  pjp],  thereby  splitting  the 
two  triangles  incident  to  ~pjp]  into  four  triangles. 

15.  LEGALlZEEDGE(pr,pIpi,T) 

16.  LegalizeEdge(>,.,p77?7,‘X) 

17.  LEGALIZEEDGEOr.pTpLT) 

18.  LEGALlZEEDGE(pripkPi,T) 

19.  Discard  p-\  and  p  2  with  all  their  incident  edges  from  X 

20.  return  T 
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Next  we  discuss  the  details  of  turning  the  triangulation  we  get  after  line  9  (or 
line  14)  into  a  Delaunay  triangulation.  We  know  from  Theorem  9.8  that  a 
triangulation  is  a  Delaunay  triangulation  if  all  its  edges  are  legal.  In  the  spirit 
of  algorithm  Leg ALTri ANGULATION,  we  therefore  flip  illegal  edges  until  the 
triangulation  is  legal  again.  The  question  that  remains  is  which  edges  may 
become  illegal  due  to  the  insertion  of  pr.  Observe  that  an  edge  ~p[p~j  that  was 
legal  before  can  only  become  illegal  if  one  of  the  triangles  incident  to  it  has 
changed.  So  only  the  edges  of  the  new  triangles  need  to  be  checked.  This 
is  done  using  the  subroutine  LegalizeEdge,  which  tests  and  possibly  flips 
an  edge.  If  LegalizeEdge  flips  an  edge,  other  edges  may  become  illegal. 
Therefore  LegalizeEdge  calls  itself  recursively  with  such  potentially  illegal 
edges. 

LegalizeEdge(/?,-,  pip],  T> 

1 .  (*  The  point  being  inserted  is  pr,  and  /?,/?  ,  is  the  edge  of  T  that  may  need 
to  be  flipped.  *) 

2.  if  pipj  is  illegal 

3.  then  Let  PiPjPk  be  the  triangle  adjacent  to  PrPiPj  along  pip]. 

4.  (*  Flip  pjp]:  *)  Replace  pip]  with  prpk- 

5.  LegalizeEdge(p,.,/?7/?lT) 

6.  LegalizeEdge(  pnPkP],T) 

The  test  in  line  2  whether  an  edge  is  illegal  can  normally  be  done  by  applying 
Lemma  9.4.  There  are  some  complications  because  of  the  presence  of  the 
special  points  /?_ i  and  2-  We  shall  come  back  to  this  later;  first  we  prove  that 
the  algorithm  is  correct. 


Section  9.3 

COMPUTING  THE  DELAUNAY 
TRIANGULATION 


Figure  9.8 

All  edges  created  are  incident  to  /?,• 


To  ensure  the  correctness  of  the  algorithm,  we  need  to  prove  that  no  illegal 
edges  remain  after  all  calls  to  LegalizeEdge  have  been  processed.  From 
the  code  of  LegalizeEdge  it  is  clear  that  every  new  edge  created  due  to  the 
insertion  of  pr  is  incident  to  /?,-.  Figure  9.8  illustrates  this;  the  triangles  that 
are  destroyed  and  the  new  triangles  are  shown  in  grey.  The  crucial  observation 
(proved  below)  is  that  every  new  edge  must  be  legal,  so  there  is  no  need  to  test 
them.  Together  with  the  earlier  observation  that  an  edge  can  only  become  illegal 
if  one  of  its  incident  triangles  changes,  this  proves  that  the  algorithm  tests  any 
edge  that  may  become  illegal.  Hence,  the  algorithm  is  correct.  Note  that,  as  in 
Algorithm  LegalTriangulation,  the  algorithm  cannot  get  into  an  infinite 
loop,  because  every  flip  makes  the  angle-vector  of  the  triangulation  larger. 
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Chapter  9  Lemma  9.10  Every  new  edge  created  in  DELAUNAYTRIANGULATION  or  in 
delaunay  triangulations  LegalizeEdge  during  the  insertion  of  pr  is  an  edge  of  the  Delaunay  graph  of 

{p-2,P~l,P0,---,Pr}- 


Proof  Consider  first  the  edges  prpi,  prpj ,  PrPk  (and  perhaps  prpi)  created  by 
splitting  PiPjPk  (and  maybe  PiPjPi).  Since  PiPjPk  is  a  triangle  in  the  Delaunay 
triangulation  before  the  addition  of  pr,  the  circumcircle  C  of  PiPjPk  contains 
no  point  p,  with  t  <  r  in  its  interior.  By  shrinking  C  we  can  find  a  circle  C' 
through  pi  and  pr  contained  in  C.  Because  C'  C  C  we  know  that  C'  is  empty. 
This  implies  that  prpj  is  an  edge  of  the  Delaunay  graph  after  the  addition  of  pr. 
The  same  holds  for  prpj  and  prpk  (and  for  prpi,  if  it  exists). 

Now  consider  an  edge  flipped  by  LegalizeEdge.  Such  an  edge  flip 
always  replaces  an  edge  ppp ]  of  a  triangle  PiPjPi  by  an  edge  pTpj  incident  to 
pr.  Since  PiPjPi  was  a  Delaunay  triangle  before  the  addition  of  pr  and  because 
its  circumcircle  C  contains  pr — otherwise  pip]  would  not  be  illegal — we  can 
shrink  the  circumcircle  to  obtain  an  empty  circle  C'  with  only  p,  and  pi  on  its 
boundary.  Hence,  pfpj  is  an  edge  of  the  Delaunay  graph  after  the  addition.  0 

We  have  proved  the  correctness  of  the  algorithm.  What  remains  is  to  describe 
how  to  implement  two  important  steps:  how  to  find  the  triangle  containing  the 
point  p,  in  line  7  of  DelaunayTriangulation,  and  how  to  deal  correctly 
with  the  points  p-  \  and  p  -2  in  the  test  in  line  2  in  LegalizeEdge.  We  start 
with  the  former  issue. 


To  find  the  triangle  containing  pr  we  use  an  approach  quite  similar  to  what  we 
did  in  Chapter  6:  while  we  build  the  Delaunay  triangulation,  we  also  build  a 
point  location  structure  CD,  which  is  a  directed  acyclic  graph.  The  leaves  of 
CD  correspond  to  the  triangles  of  the  current  triangulation  CT,  and  we  maintain 
cross-pointers  between  those  leaves  and  the  triangulation.  The  internal  nodes  of 
CD  correspond  to  triangles  that  were  in  the  triangulation  at  some  earlier  stage, 
but  have  already  been  destroyed.  The  point  location  structure  is  built  as  follows. 
In  line  3  we  initialize  CD  as  a  DAG  with  a  single  leaf  node,  which  corresponds 
to  the  triangle  pop-\p~2- 

Now  suppose  that  at  some  point  we  split  a  triangle  PiPjPk  of  the  current 
triangulation  into  three  (or  two)  new  triangles.  The  corresponding  change  in 
CD  is  to  add  three  (or  two)  new  leaves  to  CD,  and  to  make  the  leaf  for  PiPjPk 
into  an  internal  node  with  outgoing  pointers  to  those  three  (or  two)  leaves. 
Similarly,  when  we  replace  two  triangles  PkPiPj  and  PiPjPi  by  triangles  pkPiPi 
and  PkPiPj  by  an  edge  flip,  we  create  leaves  for  the  two  new  triangles,  and  the 
nodes  of  PkPiPj  and  PiPjPi  get  pointers  to  the  two  new  leaves.  Figure  9.9  shows 
an  example  of  the  changes  in  D  caused  by  the  addition  of  a  point.  Observe 
that  when  we  make  a  leaf  into  an  internal  node,  it  gets  at  most  three  outgoing 
pointers. 

Using  D  we  can  locate  the  next  point  pr  to  be  added  in  the  current  triangu¬ 
lation.  This  is  done  as  follows.  We  start  at  the  root  of  D,  which  corresponds 

_  to  the  initial  triangle  poP-iP-2-  We  check  the  three  children  of  the  root  to  see 

202  in  which  triangle  pr  lies,  and  we  descend  to  the  corresponding  child.  We  then 


Chapter  9  check  the  children  of  this  node,  descend  to  a  child  whose  triangle  contains  pr, 

delaunay  triangulations  and  so  on,  until  we  reach  a  leaf  of  T>.  This  leaf  corresponds  to  a  triangle  in  the 

current  triangulation  that  contains  pr.  Since  the  out-degree  of  any  node  is  at 
most  three,  this  takes  linear  time  in  the  number  of  nodes  on  the  search  path,  or, 
in  other  words,  in  the  number  of  triangles  stored  in  D  that  contain  pr. 

There  is  only  one  detail  left,  namely  how  to  choose  p  .  \  and  /?_ 2,  and  how  to 
implement  the  test  of  whether  an  edge  is  legal.  On  the  one  hand,  we  have  to 
choose  p_  1  and  p  2  to  be  far  away,  because  we  don’t  want  their  presence  to 
influence  the  Delaunay  triangulation  of  P.  One  the  other  hand,  we  don’t  want  to 
introduce  the  huge  coordinates  needed  for  that.  So  what  we  do  is  to  treat  these 
points  symbolically:  we  do  not  actually  assign  coordinates  to  them,  but  instead 
modify  the  tests  for  point  location  and  for  illegal  edges  such  that  they  work  as  if 
we  had  chosen  the  points  to  be  very  far  away. 

In  the  following,  we  will  say  that  p  =  ( xp,yp )  is  higher  than  q  =  (, xq,yq ) 
if  y p  >  yq  or  yp  =  yq  and  xq  >  xp,  and  use  the  (lexicographic)  ordering  on  P 
induced  by  this  relation. 

Let  i- 1  be  a  horizontal  line  lying  below  the  entire  set  P,  and  let  (-2  be  a 
horizontal  line  lying  above  P.  Conceptually,  we  choose  p  \  to  lie  on  the  line 
t-\  sufficiently  far  to  the  right  that  p_\  lies  outside  every  circle  defined  by  three 
non-collinear  points  of  P,  and  such  that  the  clockwise  ordering  of  the  points  of 
P  around  p  \  is  identical  to  their  (lexicographic)  ordering.  Next,  we  choose 
p  2  to  lie  on  the  line  i- 2  sufficiently  far  to  the  left  that  p  2  lies  outside  every 
circle  defined  by  three  non-collinear  points  of  P  U  {p  \ },  and  such  that  the 
counterclockwise  ordering  of  the  points  of  P  U  {p~  1 }  around  p~2  is  identical  to 
their  (lexicographic)  ordering. 

The  Delaunay  triangulation  of  PL)  {p-i,p  2}  consists  of  the  Delaunay 
triangulation  of  P,  edges  connecting  p-  \  to  every  point  on  the  right  convex  hull 
of  P,  edges  connecting  p  2  to  every  point  on  the  left  convex  hull  of  P,  and 
the  one  edge  ~p  \p  2-  The  lowest  point  of  P  and  the  highest  point  po  of  P  are 
connected  to  both  p  _  \  and  p  2- 

During  the  point  location  step,  we  need  to  determine  the  position  of  a 
point  pj  with  respect  to  the  oriented  line  from  /;,  to  p/,.  By  our  choice  of  p-  \ 
and  p  -2,  the  following  conditions  are  equivalent: 

■  pj  lies  to  the  left  of  the  line  from  /?,  to  p  \ ; 

■  p  j  lies  to  the  left  of  the  line  from  p  2  to  pp 
m  pj  is  lexicographically  larger  than  /?,  . 

It  remains  to  explain  how  to  treat  p-  1  and  p  2  when  we  check  whether  an 
edge  is  illegal.  Let  "pip]  be  the  edge  to  be  tested,  and  let  pj  and  p/  be  the  other 
vertices  of  the  triangles  incident  to  ppp]  (if  they  exist). 

■  PiPj  is  an  edge  of  the  triangle  pop-  \P-2-  These  edges  are  always  legal. 

■  The  indices  i,  j.k. I  are  all  non-negative.  This  is  the  normal  case;  none  of 
the  points  involved  in  the  test  is  treated  symbolically.  Hence,  ~pjp]  is  illegal 
if  and  only  if  pi  lies  inside  the  circle  defined  by  />,,  pj,  and  /?/.. 

■  All  other  cases.  In  this  case,  pip]  is  legal  if  and  only  if  min(k,/)  <  min(i,y). 
Only  the  last  case  requires  further  justification.  Since  the  situation  where 

PJP~i  is  P-iP-2  is  handled  in  the  first  case,  at  most  one  of  the  indices  i  and  j 
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9.4  The  Analysis 

We  first  look  at  the  structural  change  generated  by  the  algorithm.  This  is  the 
number  of  triangles  created  and  deleted  during  the  course  of  the  algorithm. 
Before  we  start  the  analysis,  we  introduce  some  notation:  Pr  :=  {p\, . . . . p,-} 
and  l>Sr  :=  X>S({p-2,P-i,Po}  UPr). 

Lemma  9.11  The  expected  number  of  triangles  created  by  algorithm  Delau- 
nayTriangulation  is  at  most  9n  +  1. 

Proof.  In  the  beginning,  we  create  the  single  triangle  poP-\P-2-  In  iteration  r 
of  the  algorithm,  when  we  insert  pr ,  we  first  split  one  or  two  triangles,  creating 
three  or  four  new  triangles.  This  splitting  creates  the  same  number  of  edges  in 
1) namely  prpj,  p,-Pk  (and  maybe  prpf).  Furthermore,  for  every  edge 
that  we  flip  in  procedure  LegalizeEdge,  we  create  two  new  triangles.  Again, 
the  flipping  creates  an  edge  of  D9r  incident  to  pr.  To  summarize:  if  after  the 
insertion  of  pr  there  are  k  edges  of  DQr  incident  to  pr.  then  we  have  created 
at  most  2(k  —  3)  +  3  =  2&  —  3  new  triangles.  The  number  k  is  the  degree  of  pr 
in  T>Sp  we  denote  this  degree  by  deg(pr,“DSr).  degree  of  pr,  over  all  possible 
permutations  of  the  set  PI  As  in  Chapter  4  and  6  we  use  backwards  analysis  to 
bound  this  value.  So,  for  the  moment,  we  fix  the  set  Pr.  We  want  to  bound  the 
expected  degree  of  the  point  pr,  which  is  a  random  element  of  the  set  Pr.  By 
Theorem  7.3,  the  Delaunay  graph  T>Sr  has  at  most  3(r  +  3)  —  6  edges.  Three  of 
these  are  the  edges  of  pop-  1P-2,  and  therefore  the  total  degree  of  the  vertices 
in  Pr  is  less  than  2[3(r  +  3)  —  9]  =  6r.  This  means  that  the  expected  degree  of 
a  random  point  of  Pr  is  at  most  6.  Summarizing  the  above,  we  can  bound  the 
number  of  triangles  created  in  step  r  as  follows. 

E  [number  of  triangles  created  in  step  r  <  E[2deg(/>r,D3r)-3] 

=  2E[deg(pr,©Sr)]-3 

<  2-6-3  =  9 

The  total  number  of  created  triangles  is  one  for  the  triangle  pop-\p-2  that  we 
start  with,  plus  the  number  of  triangles  created  in  each  of  the  insertion  steps. 
Using  linearity  of  expectation,  we  get  that  the  expected  total  number  of  created 
triangles  is  bounded  by  1  +  9«.  0 


is  negative.  On  the  other  hand,  either  p^_  or  pi  is  the  point  p,  that  we  have  just 
inserted,  and  so  at  most  one  of  the  indices  k  and  /  is  negative. 

If  only  one  of  the  four  indices  is  negative,  then  this  point  lies  outside  the 
circle  defined  by  the  other  three  points,  and  the  method  is  correct. 

Otherwise,  both  min(i,/)  and  minf/c. / )  are  negative,  and  the  fact  that  p  i 
lies  outside  any  circle  defined  by  three  points  in  PU{p_i}  implies  that  the 
method  is  correct. 


We  now  state  the  main  result. 
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Theorem  9.12  The  Delaunay  triangulation  of  a  set  P  of  n  points  in  the  plane 
can  be  computed  in  O(nlogn)  expected  time,  using  0(n)  expected  storage. 

Proof.  The  correctness  of  the  algorithm  follows  from  the  discussion  above.  As 
for  the  storage  requirement,  we  note  that  only  the  search  structure  CD  could  use 
more  than  linear  storage.  However,  every  node  of  CD  corresponds  to  a  triangle 
created  by  the  algorithm,  and  by  the  previous  lemma  the  expected  number  of 
these  is  0(n). 

To  bound  the  expected  running  time  we  first  ignore  the  time  spent  in  the 
point  location  step  (line  7).  Now  the  time  spent  by  the  algorithm  is  proportional 
to  the  number  of  created  triangles.  From  the  previous  lemma  we  can  therefore 
conclude  that  the  expected  running  time,  not  counting  the  time  for  point  location, 
is  0{n). 

It  remains  to  account  for  the  point  location  steps.  The  time  to  locate  the 
point  pr  in  the  current  triangulation  is  linear  in  the  number  of  nodes  of  CD  that  we 
visit.  Any  visited  node  corresponds  to  a  triangle  that  was  created  at  some  earlier 
stage  and  that  contains  pr.  If  we  count  the  triangle  of  the  current  triangulation 
separately,  then  the  time  for  locating  pr  is  0(1)  plus  linear  time  in  the  number 
of  triangles  that  were  present  at  some  earlier  stage,  but  have  been  destroyed, 
and  contain  pr. 

A  triangle  PiPjPk  can  be  destroyed  from  the  triangulation  for  one  of  two 
reasons: 

■  A  new  point  pi  has  been  inserted  inside  (or  on  the  boundary  of)  PiPjPk,  and 

PiPjPk  was  split  into  three  (or  two)  subtriangles. 

■  An  edge  flip  has  replaced  PiP/Pk  and  an  adjacent  triangle  PiPjPi  by  the  pair 

PkPiPl  and  pkPjPi- 

In  the  first  case,  the  triangle  PiPjPk  was  a  Delaunay  triangle  before  /?/  was 
inserted.  In  the  second  case,  either  PiPjPk  was  a  Delaunay  triangle  and  p /  was 
inserted,  or  pipjpi  was  a  Delaunay  triangle  and  /?/.  was  inserted.  If  PiPjPi  was 
the  Delaunay  triangle,  then  the  fact  that  the  edge  Jpp]  was  flipped  means  that 
both  pk  and  p,  lie  inside  the  circumcircle  of  PiPjPi- 

In  all  cases  we  can  charge  the  fact  that  triangle  PiPjPk  was  visited  to  a 
Delaunay  triangle  A  that  has  been  destroyed  in  the  same  stage  as  PiPjPk,  and 
such  that  the  circumcircle  of  A  contains  pr.  Denote  the  subset  of  points  in  P 
that  lie  in  the  circumcircle  of  a  given  triangle  A  by  K{ A).  In  the  argument  above 
the  visit  to  a  triangle  during  the  location  of  pr  is  charged  to  a  triangle  A  with 
pr  G  K(A).  It  is  easy  to  see  that  a  triangle  A  can  be  charged  at  most  once  for 
every  one  of  the  points  in  /G(A).  Therefore  the  total  time  for  the  point  location 
steps  is 

0(n  +  £card(*:(A))),  (9.1) 

A 

where  the  summation  is  over  all  Delaunay  triangles  A  created  by  the  algorithm. 
We  shall  prove  later  that  the  expected  value  of  this  sum  is  0(n  log/;).  This 
proves  the  theorem.  ED 

It  remains  to  bound  the  expected  size  of  the  sets  7G(A).  If  A  is  a  triangle  of 
the  Delaunay  triangulation  T>Sr,  then  what  would  we  expect  card(7G(A))  to  be? 


For  r  =  1  we  would  expect  it  to  be  roughly  n,  and  for  r  =  n  we  know  that  it  is 
zero.  What  happens  in  between?  The  nice  thing  about  randomization  is  that  it 
“interpolates”  between  those  two  extremes.  The  right  intuition  would  be  that, 
since  Pr  is  a  random  sample,  the  number  of  points  lying  inside  the  circumcircle 
of  a  triangle  A  £  T>Sr  is  about  0(n/r).  But  be  warned:  this  is  not  really  true  for 
all  triangles  in  CDS,..  Nevertheless,  the  sum  in  expression  (9.1)  behaves  as  if  it 
were  true. 

In  the  remainder  of  this  section  we  will  give  a  quick  proof  of  this  fact  for 
the  case  of  a  point  set  in  general  position.  The  result  is  true  for  the  general  case 
as  well,  but  to  see  that  we  have  to  work  a  little  bit  harder,  so  we  postpone  that  to 
the  next  section,  where  we  treat  the  problem  in  more  generality. 

Lemma  9.13  IfP  is  a  point  set  in  general  position,  then 

Ecard(W(A))  =  O(nlogn), 

A 

where  the  summation  is  over  all  Delaunay  triangles  A  created  by  the  algorithm. 

Proof.  Since  P  is  in  general  position,  every  subset  P,  is  in  general  position.  This 
implies  that  the  triangulation  after  adding  the  point  pr  is  the  unique  triangulation 
DSr-  We  denote  the  set  of  triangles  of  CDS,  by  CT, .  Now  the  set  of  Delaunay 
triangles  created  in  stage  r  equals  7r  \  7r- 1  by  definition.  Hence,  we  can  rewrite 
the  sum  we  want  to  bound  as 

E  (  E  card(A-(A)) 

r=  1  VAeTr\tTr_! 

For  a  point  q,  let  k(Pr,q )  denote  the  number  of  triangles  A  £  T,.  such  that 
q  £  K( A),  and  let  k(Priq,pr)  be  the  number  of  triangles  A  £  T,-  such  that  not 
only  q  £  K( A)  but  for  which  we  also  have  that  pr  is  incident  to  A.  Recall  that 
any  Delaunay  triangle  created  in  stage  r  is  incident  to  pr,  so  we  have 

E  card(W(A))  =  E  k(Pr,q,pr )•  (9.2) 

AGTr\trr_i  qeP\Pr 

For  the  moment,  we  fix  Pr.  In  other  words,  we  consider  all  expectations  to  be 
over  the  set  of  permutations  of  the  set  P  where  P,  is  equal  to  a  fixed  set  P* .  The 
value  of  k(Pr,q,pr)  then  depends  only  on  the  choice  of  pr.  Since  a  triangle 
A  €  T,-  is  incident  to  a  random  point  p  £  P*  with  probability  at  most  3 /r,  we  get 

E [k(Pr,q,pr)\  <  3k(P^. 

If  we  sum  this  over  all  q  £  P  \  P,  and  use  (9.2),  we  get 

E[  E  card(W(A))]  ^  -  E  k(Pnq).  (9-3) 

AGTr\0-r_!  r  qeP\Pr 


Every  q  £  P\Pr  is  equally  likely  to  appear  as  pr+ 1,  and  so  we  have 


1 


E  k(Pr^)' 

qeP\Pr 
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E[k{Pr,pr+l) 


n  —  r 
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E[  E  card(W(A))]  <3(  —  )E[k(Pr,pr+l)]. 

AGTr\Tr_i  Y 

What  is  k(Pripr+\)l  It  is  the  number  of  triangles  A  of  7r  that  have  pr+\  £  K{ A). 
By  the  criterion  from  Theorem  9.6  (i),  these  triangles  are  exactly  the  triangles 
of Tr  that  will  be  destroyed  by  the  insertion  of  pr+  \ .  Hence,  we  can  rewrite  the 
previous  expression  as 

E[  E  card(W(A))]  o(— )  E[card(T,.  \  T+1)] . 

AGTr\Tr_!  r 

Theorem  9. 1  shows  that  the  number  of  triangles  in  7m  is  precisely  2 (in  +  3)  — 
2  —  3  =  2m  +  1 .  Therefore,  the  number  of  triangles  destroyed  by  the  insertion 
of  point  pr+ 1  is  exactly  two  less  than  the  number  of  triangles  created  by  the 
insertion  of  pr+ 1,  and  we  can  rewrite  the  sum  as 

E[  E  card(W(A))]  <3(— )(E[card(T,+i\Tr)]-2). 

AGTr\Tr_i  W 

Until  now  we  considered  Pr  to  be  fixed.  At  this  point,  we  can  simply  take  the 
average  over  all  choices  of  Pr  C  P  on  both  sides  of  the  inequality  above,  and 
find  that  it  also  holds  if  we  consider  the  expectation  to  be  over  all  possible 
permutations  of  the  set  P. 

We  already  know  that  the  number  of  triangles  created  by  the  insertion  of 
pr+i  is  identical  to  the  number  of  edges  incident  to  pr+\  in  '.T,- ,  | ,  and  that  the 
expected  number  of  these  edges  is  at  most  6.  We  conclude  that 

E[  E  card(/T(A))]  <12(— ). 

AGTr\Tr_i  r 

Summing  over  r  proves  the  lemma.  Li 


9.5*  A  Framework  for  Randomized  Algorithms 

Up  to  now  we  have  seen  three  randomized  incremental  algorithms  in  this  book: 
one  for  linear  programming  in  Chapter  4,  one  for  computing  a  trapezoidal  map 
in  Chapter  6,  and  one  for  computing  a  Delaunay  triangulation  in  this  chapter. 
(We  will  see  one  more  in  Chapter  11.)  These  algorithms,  and  most  other 
randomized  incremental  algorithms  in  the  computational  geometry  literature, 
all  work  according  to  the  following  principle. 

Suppose  the  problem  is  to  compute  some  geometric  structure  T(A),  defined 
by  a  set  X  of  geometric  objects.  (For  instance,  a  Delaunay  triangulation  de¬ 
fined  by  a  set  of  points  in  the  plane.)  A  randomized  incremental  algorithm 
does  this  by  adding  the  objects  in  X  in  random  order,  meanwhile  maintaining 
the  structure  ‘T.  To  add  the  next  object,  the  algorithm  first  finds  out  where 
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the  current  structure  has  to  be  changed  because  there  is  a  conflict  with  the  Section  9.5* 
object — the  location  step — and  then  it  updates  the  structure  locally — the  update  a  framework  for  randomized 

step.  Because  all  randomized  incremental  algorithms  are  so  much  alike,  their  algorithms 
analyses  are  quite  similar  as  well.  To  avoid  having  to  prove  the  same  bounds 
over  and  over  again  for  different  problems,  an  axiomatic  framework  has  been 
developed  that  captures  the  essence  of  randomized  incremental  algorithms.  This 
framework — called  a  configuration  space — can  be  used  to  prove  ready-to-use 
bounds  for  the  expected  running  time  of  many  randomized  incremental  algo¬ 
rithms.  (Unfortunately,  the  term  “configuration  space’’  is  also  used  in  motion 
planning,  where  it  means  something  completely  different — see  Chapter  13.)  In 
this  section  we  describe  this  framework,  and  we  give  a  theorem  that  can  be  used 
to  analyze  any  randomized  incremental  algorithm  that  fits  into  the  framework. 

For  instance,  the  theorem  can  immediately  be  applied  to  prove  Lemma  9.13, 
this  time  without  assuming  that  P  has  to  be  in  general  position. 


A  configuration  space  is  defined  to  be  a  four-tuple  (A,  IT.  I).  K).  Here  X  is  the 
input  to  the  problem,  which  is  a  finite  set  of  (geometric)  objects',  we  denote  the 
cardinality  of  A  by  n.  The  set  n  is  a  set  whose  elements  are  called  configurations. 
Finally,  D  and  K  both  assign  to  every  configuration  A  €  II  a  subset  of  X,  denoted 
D(A)  and  K( A)  repectively.  Elements  of  the  set  D( A)  are  said  to  define  the 
configuration  A,  and  the  elements  of  the  set  K( A)  are  said  to  be  in  conflict  with, 
or  to  kill,  A.  The  number  of  elements  of  K(A)  is  called  the  conflict  size  of  the 
configuration  A.  We  require  that  (A,n,Z), K)  satisfies  the  following  conditions. 

■  The  number  d  :=  max{card(Z)(A))  |  A  £  IT}  is  a  constant.  We  call  this 
number  the  maximum  degree  of  the  configuration  space.  Moreover,  the 
number  of  configurations  sharing  the  same  defining  set  should  be  bounded 
by  a  constant. 

■  We  have  D{ A)  n  K(A)  =  0  for  all  configurations  A  €  II. 

A  configuration  A  is  called  active  over  a  subset  S  C  X  if  Z)( A)  is  contained  in  S 
and  K( A)  is  disjoint  from  S.  We  denote  the  set  of  configurations  active  over  S 
by  T(S),  so  we  have 

T(S):={Aen  :  D(A)  C  S  and  K(A)  DS  =  0}. 

The  active  configurations  form  the  structure  we  want  to  compute.  More  precisely, 
the  goal  is  to  compute  ‘X(A).  Before  we  continue  our  discussion  of  this  abstract 
framework,  let’s  see  how  the  geometric  structures  we  have  met  so  far  fit  in. 

Half-plane  intersection.  In  this  case  the  input  set  A  is  a  set  of  half-planes 
in  the  plane.  We  want  to  define  II,  D,  and  K  in  such  a  way  that  T(A)  is  what 
we  want  to  compute,  namely  the  intersection  of  the  half-planes  in  A.  We 
can  achieve  this  as  follows.  The  set  II  of  configurations  consists  of  all  the 
intersection  points  of  the  lines  bounding  the  half-planes  in  A.  The  defining  set 
Z)( A)  of  a  configuration  A  £  II  consists  of  the  two  lines  defining  the  intersection, 
and  the  killing  set  K( A)  consists  of  all  half-planes  that  do  not  contain  the 
intersection  point.  Hence,  for  any  subset  S  C  A,  and  in  particular  for  A  itself. 
T (.S' j  is  the  set  of  vertices  of  the  common  intersection  of  the  half-planes  in  S. 
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Chapter  9  Trapezoidal  maps.  Here  the  input  set  X  is  a  set  of  segments  in  the  plane.  The 
delaunay  triangulations  set  II  of  configurations  contains  all  trapezoids  appearing  in  the  trapezoidal  map 

of  any  S  Cl.  The  defining  set  /9(A)  of  a  configuration  A  is  the  set  of  segments 
that  are  necessary  to  define  A.  The  killing  set  K( A)  of  a  trapezoid  A  is  the  set 
of  segments  that  intersect  A.  With  these  definitions,  7(S)  is  exactly  the  set  of 
trapezoids  of  the  trapezoidal  map  of  S. 

Delaunay  Triangulation.  The  input  set  X  is  a  set  of  points  in  general  position 
in  the  plane.  The  set  II  of  configurations  consists  of  triangles  formed  by  three 
(non-collinear)  points  in  X.  The  defining  set  D(A)  consists  of  the  points  that 
form  the  vertices  of  A,  and  the  killing  set  K( A)  is  the  set  of  points  lying  inside 
the  circumcircle  of  A.  By  Theorem  9.6,  T(S)  is  exactly  the  set  of  triangles  of 
the  unique  Delaunay  triangulation  of  S. 

As  stated  earlier,  the  goal  is  to  compute  the  structure  T(X).  Randomized  incre¬ 
mental  algorithms  do  this  by  computing  a  random  permutation  x\,X2,  ■  ■  ■  ■  xn  of 
the  objects  in  X  and  then  adding  the  objects  in  this  order,  meanwhile  maintaining 
7(Xr),  where  Xr  :=  {xi,X2,  ■  ■  ■  ,xr}.  The  fundamental  property  of  configuration 
spaces  that  makes  this  possible  is  that  we  can  decide  whether  or  not  a  config¬ 
uration  A  appears  in  7(Xr)  by  looking  at  it  locally — we  only  need  to  look  for 
the  defining  and  killing  objects  of  A.  In  particular,  7(Xr)  does  not  depend  on 
the  order  in  which  the  objects  in  Xr  were  added.  For  instance,  a  triangle  A  is  in 
the  Delaunay  triangulation  of  5  if  and  only  if  the  vertices  of  A  are  in  5,  and  no 
point  of  S  lies  in  the  circumcircle  of  A. 

The  first  thing  we  usually  did  when  we  analyzed  a  randomized  incremental 
algorithm  was  to  prove  a  bound  on  the  expected  structural  change — see  for 
instance  Lemma  9.11.  The  next  theorem  does  the  same,  but  now  in  the  abstract 
configuration-space  framework. 

Theorem  9.14  Let  (X  .Tl.I).  K)  be  a  configuration  space,  and  let  7  and  Xr  be 
defined  as  above.  Then  the  expected  number  of  configurations  in  7(Xr)  \ 
T(Xr_i)  is  at  most 

-E\  card(T(Xr))], 

where  d  is  the  maximum  degree  of  the  conhguration  space. 

Proof.  As  in  previous  occasions  where  we  wanted  to  bound  the  structural 
change,  we  use  backwards  analysis:  instead  of  trying  to  argue  about  the  number 
of  configurations  that  appear  due  to  the  addition  of  xr  into  Xr-\,  we  shall  argue 
about  the  number  of  configurations  that  disappear  when  we  remove  xr  from  Xr. 
To  this  end  we  temporarily  let  Xr  be  some  fixed  subset  X*  C  X  of  cardinality  r. 
We  now  want  to  bound  the  expected  number  of  configurations  A  £  7(Xr)  that 
disappear  when  we  remove  a  random  object  xr  from  Xr.  By  definition  of  7,  such 
a  configuration  A  must  have  xr  £  D( A).  Since  there  are  at  most  d  ■  card(T(W)) 
pairs  (x,  A)  with  A  £  7(Xr)  and  x  £  D( A),  we  have 

£  card({A  £  7(Xr)  |  x  £  D(A)})  <  d ■  card(T(Xr)). 
xexr 
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This  theorem  gives  a  generic  bound  for  the  expected  size  of  the  structural 
change  during  a  randomized  incremental  algorithm.  But  what  about  the  cost  of 
the  location  steps?  In  many  cases  we  will  need  a  bound  of  the  same  form  as  in 
this  chapter,  namely  we  need  to  bound 

£card(Z(A)), 

A 

where  the  summation  is  over  all  configurations  A  that  are  created  by  the  algo¬ 
rithm,  that  is,  all  configurations  that  appear  in  one  of  the  7(Xr).  This  bound  is 
given  in  the  following  theorem. 

Theorem  9.15  Let  (X.  Id.  I).  K)  be  a  configuration  space,  and  let  T  and  Xr  be 
defined  as  above.  Then  the  expected  value  of 

£card(£(A)), 

A 

where  the  summation  is  over  all  configurations  A  appearing  in  at  least  one  T(Ar) 
with  1  ^  r  ^  n,  is  at  most 

f  i2(n~  r)(E[card(T(Xf))] 

r—  1  1  1 

where  d  is  the  maximum  degree  of  the  configuration  space. 

Proof.  We  can  follow  the  proof  of  Lemma  9. 13  quite  closely.  We  first  rewrite 
the  sum  as 

E  (  E  card(W(A)) 

r=  1  \AeTr\trr_i 

Next,  let  k(Xr,y)  denote  the  number  of  configurations  A  £  7(Xr)  such  that 
y  £  K( A),  and  let  k(Xr,y,xr)  be  the  number  of  configurations  A  £  7(Xr)  such  that 
not  only  y  £  /f(A)  but  for  which  we  also  have  xr  £  D{ A).  Any  new  configuration 
appearing  due  to  the  addition  of  xr  must  have  xr  £  D{ A).  This  implies  that 

E  card(W(A))  =  E  K^r,y,xr).  (9.4) 

AeTr\Tr_i  yex\xr 

We  now  fix  the  set  Xr.  The  expected  value  of  k(Xr,y,xr)  then  depends  only  on 
the  choice  of  xr  £  X,  .  Since  the  probability  that  y  £  D{ A)  for  a  configuration 
A  £  7(Xr)  is  at  most  d/r,  we  have 


Hence,  the  expected  number  of  configurations  disappearing  due  to  the  removal 
of  a  random  object  from  Xr  is  at  most  j  card(T(A,-)).  In  this  argument,  the  set 
Xr  was  a  fixed  subset  X*  C  X  of  cardinality  r.  To  obtain  the  general  bound, 
we  have  to  average  over  all  possible  subsets  of  size  r,  which  gives  a  bound  of 
^E[card(T(W))].  0 


E  [k(Xr,y,xr)\  < 


dk(Xr,y) 


r 


211 


Chapter  9 

DELAUNAY  TRIANGULATIONS 


212 


If  we  sum  this  over  all  y  £  X  \Xr  and  use  (9.4),  we  get 

E[  £  card(A'(A))]  ^  —  £  k(Xr,y).  (9.5) 

AeTr\Tr_!  r  yex\xr 

On  the  other  hand,  every  y  £  X  \  Xr  is  equally  likely  to  appear  as  xr+\,  so 

E[k(Ar,xr+i)]  =  — —  £  k(Xr,y). 

n  r  yex\xr 

Substituting  this  into  (9.5)  gives 

E[  £  caid(tf(A))]  )E[k(Xr,x,+i)]. 

Aetrr\Tr_i  r 

Now  observe  that  k(Xr,xr+ 1)  is  the  number  of  configurations  A  of  T(Xr)  that 
will  be  destroyed  in  the  next  stage,  when  xr+\  is  inserted.  This  means  we  can 
rewrite  the  last  expression  as 

E[  £  card(tf(A))]  ^d(— )  E[card(T(X,) \T(Zr+i))] .  (9.6) 

AGTr\Tr_!  r 

Unlike  in  the  proof  of  Lemma  9.13,  however,  we  cannot  simply  bound  the  num¬ 
ber  of  configurations  destroyed  in  stage  r  +  1  by  the  number  of  configurations 
created  at  that  stage,  because  that  need  not  be  true  in  a  general  configuration 
space.  Hence,  we  proceed  somewhat  differently. 

First  we  observe  that  we  can  take  the  average  over  all  choices  of  Xr  on  both 
sides  of  (9.6)  and  find  that  it  also  holds  if  the  expectation  is  over  all  permutations 
of  X.  Next,  we  sum  over  all  r,  and  rewrite  the  sum  as  follows: 

t  d("Ip)  carilW)  VX(X,+l))  =  •  (9.7) 

where  the  summation  on  the  right  hand  side  is  over  all  configurations  A  that 
are  created  and  later  destroyed  by  the  algorithm,  and  where  /  (A)  denotes  the 
stage  when  configuration  A  is  destroyed.  Let  i( A)  denote  the  stage  when  the 
configuration  A  is  created.  Since  /(A)  ^  j( A)  —  1,  we  have 

n  —  [ /(A)  —  1]  n  n  n  —  i(  A) 

,-(a)-i  =  = 

If  we  substitute  this  into  (9.7),  we  see  that 

r=  1  v  1  y  A  V  y 

The  right  hand  side  of  this  expression  is  at  most 

£  d(— )  card  (T (Xr)  \  T (Xr- 1 ) ) 

r=  1  ^ 


(the  difference  being  only  those  configurations  that  are  created  but  never  de¬ 
stroyed)  and  so  we  have 

E[£  £  card^A))]  <  £ E[card(TW \T(X-0)] • 

r=lAGTr\Tr_1  r=  1  ' 

By  Theorem  9.14,  we  get  the  bound  we  wanted  to  prove: 

E[£  E  card^A))]  <  Ed(^-4)^E[card(T(Xr))].  □ 

r=lAeTr\W-l  r=  1  '  ' 

This  finishes  the  analysis  in  the  abstract  setting.  As  an  example,  we  will 
show  how  to  apply  the  results  to  our  randomized  incremental  algorithm  for 
computing  the  Delaunay  triangulation.  In  particular,  we  will  prove  that 

£card(^T(A))  =  O(n\ogn), 

A 

where  the  summation  is  over  all  triangles  A  created  by  the  algorithm,  and  where 
K (A)  is  the  set  of  points  in  the  circumcircle  of  the  triangle. 

Unfortunately,  it  seems  impossible  to  properly  define  a  configuration  space 
whose  configurations  are  triangles  when  the  points  are  not  in  general  position. 
Therefore  we  shall  choose  the  configurations  slightly  differently. 

Let  P  be  a  set  of  points  in  the  plane,  not  necessarily  in  general  position.  Let  Q  := 
{po,p-i,p-2j  denote  the  set  of  three  points  we  used  to  start  the  construction. 
Recall  that  p o  is  the  lexicographically  largest  point  from  P,  while  points  p  _  \ 
and  p-2  were  chosen  such  that  they  do  not  destroy  any  Delaunay  edges  between 
points  inP.  WesetX  :=  P\{po}.  Every  triple  A  =  (pi,Pj,Pk)  of  points  inXUD 
that  do  not  lie  on  a  line  defines  a  configuration  with  D( A)  :=  {pi-Pj-Pk}  OX 
and  K{ A)  is  the  set  of  points  of  X  that  lie  either  in  the  interior  of  the  circumcircle 
of  the  triangle  PiPjPk  or  on  the  circular  arc  on  the  circumcircle  from  p,  to  pk 
containing  pj.  We  call  such  a  configuration  A  a  Delaunay  corner  of  X,  because 
A  is  active  over  S  C  X  if  and  only  if  p,-,  pj,  and  pk  are  consecutive  points  on  the 
boundary  of  one  face  of  the  Delaunay  graph  D  5  (Q  U  .S') .  Note  that  any  set  of 
three  non-collinear  points  defines  three  different  configurations. 

The  important  observation  is  that  whenever  DelaunayTriangulation 
creates  a  new  triangle,  this  triangle  is  of  the  form  PiPrPj ,  where  p,  is  the 
point  inserted  in  this  stage,  and  prp\  and  prpj  are  edges  of  the  Delaunay  graph 
DS(£iUP,) — see  Lemma  9.10.  It  follows  that  when  the  triangle  p,p,-p/  is 
created,  the  triple  ( pj,pr,Pi )  is  a  Delaunay  corner  of  D (-] (Q  LJ  /) )  and.  hence, 
it  is  an  active  configuration  over  the  set  Pr.  The  set  K( A)  defined  for  this 
configuration  contains  all  points  contained  in  the  circumcircle  of  the  triangle 
PiPrPj.  We  can  therefore  bound  the  original  sum  by 

£card(W(A)), 

A 
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■  points  in  K( A) 

□  points  not  in  K( A) 


where  the  sum  is  over  all  Delaunay  comers  A  that  appear  in  some  intermediate 
Delaunay  graph  DS(£XUPr). 
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Now  Theorem  9.15  applies.  How  many  Delaunay  corners  are  there  in  the 
Delaunay  graph  of  S  U  fl?  The  worst  case  is  when  the  Delaunay  graph  is  a 
triangulation.  If  S  contains  r  points,  then  the  triangulation  has  2(r  +  3)  —  5 
triangles,  and  therefore  6(r+  3)  —  15  =  6r+  3  Delaunay  corners.  It  follows 
from  Theorem  9. 15  that 

^  card  (A- (A))  <  Yj9(~ — -)  (— — <  54n  £  “  <  54n(lnn+ 1). 
a  r=i  v  r  v  r  r=i  r 

This  finally  completes  the  proof  of  Theorem  9.12. 


9.6  Notes  and  Comments 

The  problem  of  triangulating  a  set  of  points  is  a  topic  in  computational  geometry 
that  is  well  known  outside  this  field.  Triangulations  of  point  sets  in  two  and  more 
dimensions  are  of  paramount  importance  in  numerical  analysis,  for  instance 
for  finite  element  methods,  but  also  in  computer  graphics.  In  this  chapter  we 
looked  at  the  case  of  triangulations  that  only  use  the  given  points  as  vertices.  If 
additional  points — so-called  Steiner  points — are  allowed,  the  problem  is  also 
known  as  meshing  and  is  treated  in  more  detail  in  Chapter  14. 

Lawson  [244]  proved  that  any  two  triangulations  of  a  planar  point  set  can  be 
transformed  into  each  other  by  flipping  edges.  He  later  suggested  finding  a  good 
triangulation  by  iteratively  flipping  edges,  where  such  an  edge-flip  improves 
some  cost  function  of  the  triangulation  [245], 

It  had  been  observed  for  some  time  that  triangulations  that  lead  to  good 
interpolations  avoid  long  and  skinny  triangles  [38],  The  result  that  there  is — if 
we  ignore  degenerate  cases — only  one  locally  optimal  triangulation  with  respect 
to  the  angle-vector,  namely  the  Delaunay  triangulation,  is  due  to  Sibson  [360]. 

Looking  only  at  the  angle-vector  completely  ignores  the  height  of  the  data 
points,  and  is  therefore  also  called  the  data-independent  approach.  A  good 
motivation  for  this  approach  is  given  by  Rippa  [328],  who  proves  that  the  De¬ 
launay  triangulation  is  the  triangulation  that  minimizes  the  roughness  of  the 
resulting  terrain,  no  matter  what  the  actual  height  data  is.  Here,  roughness  is 
defined  as  the  integral  of  the  square  of  the  L?-norm  of  the  gradient  of  the  terrain. 
More  recent  research  tries  to  find  improved  triangulations  by  taking  the  height 
information  into  account.  This  data-dependent  approach  was  first  proposed  by 
Dyn  et  al.  [154],  who  suggest  different  cost  criteria  for  triangulations,  which 
depend  on  the  height  of  the  data  points.  Interestingly,  they  compute  their  im¬ 
proved  triangulations  by  starting  with  the  Delaunay  triangulation  and  iteratively 
flipping  edges.  The  same  approach  is  taken  by  Quak  and  Schumaker  [325],  who 
consider  piecewise  cubic  interpolation,  and  Brown  [76].  Quak  and  Schumaker 
observe  that  their  triangulations  are  small  improvements  compared  to  the  Delau¬ 
nay  triangulation  when  they  try  to  approximate  smooth  surfaces,  but  that  they 
can  be  drastically  different  for  non-smooth  surfaces. 

More  references  relevant  to  Delaunay  triangulations  as  the  dual  of  Voronoi 
diagrams  can  be  found  in  Chapter  7. 


The  randomized  incremental  algorithm  we  have  given  here  is  due  to  Guibas 
et  al.  [196],  but  our  analysis  of  £Acard(W(A))  is  from  Mulmuley’s  book  [290]. 
The  argument  that  extends  the  analysis  to  the  case  of  points  in  degenerate 
position  is  new.  Alternative  randomized  algorithms  were  given  by  Boissonnat 
et  al.  [69,  71],  and  by  Clarkson  and  Shor  [133], 

Various  geometric  graphs  defined  on  a  set  of  points  P  have  been  found  to  be 
subgraphs  of  the  Delaunay  triangulation  of  P.  The  most  important  one  is  proba¬ 
bly  the  Euclidean  minimum  spanning  tree  (EMST)  of  the  set  of  points  [349]; 
others  are  the  Gabriel  graph  [186]  and  the  relative  neighborhood  graph  [374]. 
We  treat  these  geometric  graphs  in  the  exercises. 

Another  important  triangulation  is  the  minimum  weight  triangulation ,  that 
is,  a  triangulation  whose  weight  is  minimal  (where  the  weight  of  a  triangulation 
is  the  sum  of  the  lengths  of  all  edges  of  the  triangulation)  [12,  42,  146,  147]. 
Determining  a  minimum  weight  triangulation  among  all  triangulations  of  a 
given  point  set  was  recently  shown  to  be  NP-complete  [291], 


9.7  Exercises 


9.1  In  this  exercise  we  look  at  the  number  of  different  triangulations  that  a 
set  of  n  points  in  the  plane  may  allow. 

a.  Prove  that  no  set  of  n  points  can  be  triangulated  in  more  than  2^) 
ways. 

b.  Prove  that  there  are  sets  of  n  points  that  can  be  triangulated  in  at  least 
2«-2V»  different  ways. 


9.2  The  degree  of  a  point  in  a  triangulation  is  the  number  of  edges  incident  to 
it.  Give  an  example  of  a  set  of  n  points  in  the  plane  such  that,  no  matter 
how  the  set  is  triangulated,  there  is  always  a  point  whose  degree  is  n  —  1. 

9.3  Prove  that  any  two  triangulations  of  a  planar  point  set  can  be  transformed 
into  each  other  by  edge  flips.  Hint:  Show  first  that  any  two  triangulations 
of  a  convex  polygon  can  be  transformed  into  each  other  by  edge  flips. 

9.4  Prove  that  the  smallest  angle  of  any  triangulation  of  a  convex  polygon 
whose  vertices  lie  on  a  circle  is  the  same.  This  implies  that  any  completion 
of  the  Delaunay  triangulation  of  a  set  of  points  maximizes  the  minimum 
angle. 


9.5 


a.  Given  four  points  p,  q,  r,  s  in  the  plane,  prove  that  point  s  lies  in  the 
interior  of  the  circle  through  p,  q,  and  r  if  and  only  if  the  following 
condition  holds.  Assume  that  p.  q.  r  form  the  vertices  of  a  triangle  in 
clockwise  order. 
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b.  The  determinant  test  of  part  a.  can  be  used  to  test  if  an  edge  in  a 
triangulation  is  legal.  Can  you  come  up  with  an  alternative  way  to 
implement  this  test?  Discuss  the  advantages  and/or  disadvantages  of 
your  method  compared  to  the  determinant  test. 

9.6  We  have  described  algorithm  DelaunayTriangulation  by  calling 
a  recursive  procedure  LegalizeEdge.  Give  an  iterative  version  of 
this  procedure,  and  discuss  the  advantages  and/or  disadvantages  of  your 
procedure  over  the  recursive  one. 

9.7  Prove  that  all  edges  of  rD<p(Pr)  that  are  not  in  DS(Pr-i)  are  incident  to 
pr.  In  other  words,  the  new  edges  of  X>S(P,-)  form  a  star  as  in  Figure  9.8. 
Give  a  direct  proof,  without  referring  to  algorithm  DELAUNAYTRIANGU¬ 
LATION. 

9.8  Let  P  be  a  set  of  n  points  in  general  position,  and  let  q  ^  P  be  a  point 
inside  the  convex  hull  of  P.  Let  Pi-Pj-Pk  be  the  vertices  of  a  triangle 
in  the  Delaunay  triangulation  of  P  that  contains  q.  (Since  q  can  lie  on 
an  edge  of  the  Delaunay  triangulation,  there  can  be  two  such  triangles.) 
Prove  that  ([pi,  qp],  and  qpk  are  edges  of  the  Delaunay  triangulation  of 
/'  •'{'/! • 

9.9  The  algorithm  given  in  this  chapter  is  randomized,  and  it  computes  the 
Delaunay  triangulation  of  a  set  of  n  points  in  0(n  log/;)  expected  time. 
Show  that  the  worst-case  running  time  of  the  algorithm  is  Q.(n2). 

9.10  The  algorithm  given  in  this  chapter  uses  two  extra  points  p  _  \  and  p  i  to 
start  the  construction  of  the  Delaunay  triangulation.  These  points  should 
not  lie  in  any  circle  defined  by  three  input  points,  and  so  far  away  that 
they  see  the  points  of  P  in  their  lexicographic  order.  These  conditions 
were  enforced  by  implementing  operations  involving  these  points  in  a 
special  way — see  page  204.  Compute  explicit  coordinates  for  the  extra 
points  such  that  this  special  implementation  is  not  needed.  Is  this  a  better 
approach? 

9.1 1  A  Euclidean  minimum  spanning  tree  (EMST)  of  a  set  P  of  points  in  the 
plane  is  a  tree  of  minimum  total  edge  length  connecting  all  the  points. 
EMST’s  are  interesting  in  applications  where  we  want  to  connect  sites 
in  a  planar  environment  by  communication  lines  (local  area  networks), 
roads,  railroads,  or  the  like. 

a.  Prove  that  the  set  of  edges  of  a  Delaunay  triangulation  of  P  contains 
an  EMST  for  P. 

b.  Use  this  result  to  give  an  0{n  log«)  algorithm  to  compute  an  EMST 
for  P. 

9.12  The  traveling  salesman  problem  (TSP)  is  to  compute  a  shortest  tour 
visiting  all  points  in  a  given  point  set.  The  traveling  salesman  problem  is 
NP-hard.  Show  how  to  find  a  tour  whose  length  is  at  most  two  times  the 
optimal  length,  using  the  EMST  defined  in  the  previous  exercise. 


9.13  The  Gabriel  graph  of  a  set  P  of  points  in  the  plane  is  defined  as  follows: 
Two  points  p  and  q  are  connected  by  an  edge  of  the  Gabriel  graph  if  and 
only  if  the  disc  with  diameter  pq  does  not  contain  any  other  point  of  P. 

a.  Prove  that  DS (P)  contains  the  Gabriel  graph  of  P. 

b.  Prove  that  p  and  q  are  adjacent  in  the  Gabriel  graph  of  P  if  and  only  if 
the  Delaunay  edge  between  p  and  q  intersects  its  dual  Voronoi  edge. 

c.  Give  an  O(n\ogn)  time  algorithm  to  compute  the  Gabriel  graph  of  a 
set  of  n  points. 

9.14  The  relative  neighborhood  graph  of  a  set  P  of  points  in  the  plane  is 
defined  as  follows:  Two  points  p  and  q  are  connected  by  an  edge  of  the 
relative  neighborhood  graph  if  and  only  if 
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d{p,q )  <  min  max(d(p,r)1d(q,r)). 

reP,r^p,q 

a.  Given  two  points  p  and  q ,  let  htneip.q)  be  the  moon-shaped  region 
formed  as  the  intersection  of  the  two  circles  around  p  and  q  whose 
radius  is  d(p,q).  Prove  that  p  and  q  are  connected  in  the  relative 
neighborhood  graph  if  and  only  if  lune(p,q)  does  not  contain  any 
point  of  P  in  its  interior. 

b.  Prove  that  X>g(P)  contains  the  relative  neighborhood  graph  of  P. 

c.  Design  an  algorithm  to  compute  the  relative  neighborhood  graph  of  a 
given  point  set. 

9.15  Prove  the  following  relationship  between  the  edge  sets  of  an  EMST,  of 
the  relative  neighborhood  graph  (RNG),  the  Gabriel  graph  (GG),  and  the 
Delaunay  graph  CDS)  of  a  point  set  P. 


EMST  C  RNG  C  GG  C  2)3 . 


(See  the  previous  exercises  for  the  definition  of  these  graphs.) 

9.16  A  ^-clustering  of  a  set  P  of  n  points  in  the  plane  is  a  partitioning  of  P 
into  k  non-empty  subsets  Pi , . . .  ,Pp .  Define  the  distance  between  any  pair 
Pi,Pj  of  clusters  to  be  the  minimum  distance  between  one  point  from  P, 
and  one  point  from  Pj,  that  is. 


dist (Pi,Pj)  :=  min  dist(/?,g). 

pePi,qePj 

We  want  to  find  a  ^-clustering  (for  given  k  and  P)  that  maximizes  the 

minimum  distance  between  clusters. 

a.  Suppose  the  mimimum  distance  between  clusters  is  achieved  by  points 
p  €  Pj  and  qGPj.  Prove  that  ~pq  is  an  edge  of  the  Delaunay  triangulation 
ofP. 

b.  Give  an  G(nlogn)  time  algorithm  to  compute  a  ^-clustering  maximiz¬ 
ing  the  minimum  distance  between  clusters.  Hint:  Use  a  Union-Find 
data  structure. 
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DELAUNAY  TRIANGULATIONS 


9.17  The  weight  of  a  triangulation  is  the  sum  of  the  lengths  of  all  edges  of 
the  triangulation.  A  minimum  weight  triangulation  is  a  triangulation 
whose  weight  is  minimal.  Disprove  the  conjecture  that  the  Delaunay 
triangulation  is  a  minimum  weight  triangulation. 

9.18*  Give  an  example  of  a  geometric  configuration  space  (X  .W.D.K)  where 
T(Xr)  \  T(Xr+i)  can  be  arbitrarily  large  compared  to  T (Xr+  \ )  \  T(A,  ). 

9.19*  Apply  configuration  spaces  to  analyze  the  randomized  incremental  algo¬ 
rithm  of  Chapter  6. 
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10  More  Geometric  Data  Structures 

Windowing 


In  the  future  most  cars  will  be  equipped  with  a  vehicle  navigation  system  to 
help  you  determine  your  position  and  to  guide  you  to  your  destination.  Such 
a  system  stores  a  roadmap  of,  say,  the  whole  of  the  U.S.  It  also  keeps  track  of 
where  you  are,  so  that  it  can  show  the  appropriate  part  of  the  roadmap  at  any 
time  on  a  little  computer  screen;  this  will  usually  be  a  rectangular  region  around 
your  present  position.  Sometimes  the  system  will  do  even  more  for  you.  For 
example,  it  might  warn  you  when  a  turn  is  coming  up  that  you  should  take  to 
get  to  your  destination. 


Figure  10.1 

A  windowing  query  in  a  map  of  the  U.S. 


To  be  of  any  use,  the  map  should  contain  sufficient  detail.  A  detailed  map 
of  the  whole  of  Europe  contains  an  enormous  amount  of  data.  Fortunately,  only 
a  small  part  of  the  map  has  to  be  displayed.  Nevertheless,  the  system  still  has  to 
find  that  part  of  the  map:  given  a  rectangular  region,  or  a  window ,  the  system 
must  determine  the  part  of  the  map  (roads,  cities,  and  so  on)  that  lie  in  the 
window,  and  display  them.  This  is  called  a  windowing  query. 

Checking  every  single  feature  of  the  map  to  see  if  it  lies  inside  the  window 
is  not  a  workable  method  with  the  amount  of  data  that  we  are  dealing  with. 
What  we  should  do  is  to  store  the  map  in  some  kind  of  data  structure  that  allows 
us  to  retrieve  the  part  inside  a  window  quickly. 


Windowing  queries  are  not  only  useful  operations  on  geographic  maps.  They 
also  play  an  important  role  in  several  applications  in  computer  graphics  and 
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Chapter  10  CAD/CAM.  One  example  is  flight  simulation.  A  model  of  a  landscape  can 

more  geometric  data  structures  consist  of  a  huge  number  of  triangles,  but  only  a  small  part  of  the  landscape 

will  be  within  sight  of  the  pilot.  So  we  have  to  select  the  part  of  the  landscape 
that  lies  within  a  given  region.  Here  the  region  is  3-dimensional,  and  it  is  called 
the  viewing  volume.  Another  example  comes  from  the  design  of  printed  circuit 
boards.  Such  a  design  typically  consists  of  (a  number  of  layers  of)  a  planar 
drawing  showing  the  location  of  traces  and  components.  (See  also  Chapter  14.) 
In  the  design  process  one  often  wants  to  zoom  in  onto  a  certain  portion  of  the 
board  to  inspect  it  more  closely.  Again,  what  we  need  is  to  determine  the  traces 
and  components  on  the  board  that  lie  inside  the  window.  In  fact,  windowing 
is  required  whenever  one  wants  to  inspect  a  small  portion  of  a  large,  complex 
object. 

Windowing  queries  are  similar  to  the  range  queries  studied  in  Chapter  5.  The 
difference  is  the  type  of  data  that  is  dealt  with:  the  data  for  range  queries  are 
points,  whereas  the  data  for  windowing  queries  are  typically  line  segments, 
polygons,  curves,  and  so  on.  Also,  for  range  queries  we  often  deal  with  higher¬ 
dimensional  search  spaces,  while  for  windowing  queries  the  search  space  usually 
is  2-  or  3 -dimensional. 


10.1  Interval  Trees 


Let’s  start  with  the  easiest  of  the  examples  that  we  gave  above,  namely  window¬ 
ing  for  a  printed  circuit  board.  The  reason  that  this  example  is  easier  than  the 
others  is  that  the  type  of  data  is  restricted:  the  objects  on  a  printed  circuit  board 
normally  have  boundaries  that  consist  of  line  segments  with  a  limited  number 
of  possible  orientations.  Often  they  are  parallel  to  one  of  the  sides  of  the  board 
or  make  45  degree  angles  with  the  sides.  Here  we  only  consider  the  case  were 
the  segments  are  parallel  to  the  sides.  In  other  words,  if  we  consider  the  x-axis 
to  be  aligned  with  the  bottom  side  of  the  board,  and  the  y-axis  to  be  aligned 
with  the  left  side  of  the  board,  then  any  segment  is  parallel  to  either  the  x-axis 
or  the  y-axis:  the  segments  are  axis-parallel ,  or  orthogonal.  We  assume  that  the 
query  window  is  an  axis-parallel  rectangle,  that  is,  a  rectangle  whose  edges  are 
axis-parallel. 

Let  S  be  a  set  of  n  axis-parallel  line  segments.  To  solve  windowing  queries  we 
need  a  data  structure  that  stores  S  in  such  a  way  that  the  segments  intersecting  a 
query  window  W  :=  [x :  x']  x  [y  :  y']  can  be  reported  efficiently.  Let’s  first  see  in 
what  ways  a  segment  can  intersect  the  window.  There  are  a  number  of  different 
cases:  the  segment  can  lie  entirely  inside  W,  it  can  intersect  the  boundary  of 
W  once,  it  can  intersect  the  boundary  twice,  or  it  can  (partially)  overlap  the 
boundary  of  W.  In  most  cases  the  segment  has  at  least  one  endpoint  inside  W. 
We  can  find  such  segments  by  performing  a  range  query  with  W  in  the  set  of  2 n 
endpoints  of  the  segments  in  5.  In  Chapter  5  we  have  seen  a  data  structure  for 
this:  the  range  tree.  A  2-dimensional  range  tree  uses  O(n\ogn)  storage,  and  a 
range  query  can  be  answered  in  O(log2n  +k)  time,  where  k  is  the  number  of 
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reported  points.  We  have  also  shown  that  we  can  apply  fractional  cascading  to 
reduce  the  query  time  to  O(\ogn  +  k).  There  is  one  little  problem.  If  we  do  a 
range  query  with  W  in  the  set  of  segment  endpoints,  we  report  the  segments  that 
have  both  endpoints  inside  W  twice.  This  can  be  avoided  by  marking  a  segment 
when  we  report  it  for  the  first  time,  and  only  reporting  segments  that  are  not  yet 
marked.  Alternatively,  when  we  find  an  endpoint  of  a  segment  to  lie  inside  W, 
we  can  check  whether  the  other  endpoint  lies  inside  W  as  well.  If  not  we  report 
the  segment.  If  the  other  endpoint  does  lie  inside  W,  we  only  report  the  segment 
when  the  current  endpoint  is  the  leftmost  or  bottom  endpoint.  This  leads  to  the 
following  lemma. 

Lemma  10.1  Let  S  be  a  set  of  n  axis-parallel  line  segments  in  the  plane.  The 
segments  that  have  at  least  one  endpoint  inside  an  axis-parallel  query  window 
W  can  be  reported  in  O(logn  +  k)  time  with  a  data  structure  that  uses  O(n\ogn) 
storage  and  preprocessing  time,  where  k  is  the  number  of  reported  segments. 

It  remains  to  find  the  segments  that  do  not  have  an  endpoint  inside  the  query 
window.  Such  segments  either  cross  the  boundary  of  W  twice  or  contain  one 
edge  of  the  boundary.  When  the  segment  is  vertical  it  will  cross  both  horizontal 
edges  of  the  boundary.  When  it  is  horizontal  it  will  cross  both  vertical  edges. 
Hence,  we  can  find  such  segments  by  reporting  all  segments  that  intersect  the 
left  edge  of  the  boundary  and  all  segments  that  intersect  the  bottom  edge  of  the 
boundary.  (Note  that  there  is  no  need  to  query  with  the  other  two  edges  of  the 
boundary.)  To  be  precise,  we  should  only  report  those  segments  that  do  not  have 
an  endpoint  inside  W,  because  the  others  were  already  reported  before.  Let’s 
consider  the  problem  of  finding  the  horizontal  segments  intersected  by  the  left 
edge  of  W;  to  deal  with  the  top  edge  we  just  have  to  reverse  the  roles  of  the  x- 
and  ^-coordinates. 

We  have  arrived  at  the  following  problem:  preprocess  a  set  S  of  horizontal 
line  segments  in  the  plane  such  that  the  segments  intersecting  a  vertical  query 
segment  can  be  reported  efficiently.  To  gain  some  insight  into  the  problem  we 
first  look  at  a  simpler  version,  namely  where  the  query  segment  is  a  full  line. 
Let  i  :=  (x  =  qx)  denote  the  query  line.  A  horizontal  segment  s  :=  (x,y){x! ,y) 
is  intersected  by  t  if  and  only  if  x  L  qx  x' .  So  only  the  x-coordi  nates  of  the 
segments  play  a  role  here.  In  other  words,  the  problem  becomes  1 -dimensional: 
given  a  set  of  intervals  on  the  real  line,  report  the  ones  that  contain  the  query 
point  qx. 

Let  I  :=  {[jci  :  xj],  [x2  :  x^], . . . ,  [xn  :  x'n}}  be  a  set  of  closed  intervals  on  the 
real  line.  To  keep  the  connection  with  the  2-dimensional  problem  alive  we 
image  the  real  line  to  be  horizontal,  and  we  say  “to  the  left  (right)  of”  instead  of 
“less  (greater)  than”.  Let  xmKi  be  the  median  of  the  2 n  interval  endpoints.  So  at 
most  half  of  the  interval  endpoints  lies  to  the  left  of  xmid  and  at  most  half  of  the 
endpoints  lies  to  the  right  of  xml(|.  If  the  query  value  qx  lies  to  the  left  of  xmjd 
then  the  intervals  that  lie  completely  to  the  right  of  xmid  obviously  do  not  contain 
qx.  We  construct  a  binary  tree  based  on  this  idea.  The  right  subtree  of  the  tree 
stores  the  set  /right  of  the  intervals  lying  completely  to  the  right  of  Xmid,  and  the 
left  subtree  stores  the  set  7ieft  of  intervals  completely  to  the  left  of  xm;d.  These 
subtrees  are  constructed  recursively  in  the  same  way.  There  is  one  problem  that 
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Figure  10.2 

Classification  of  the  segments  with 
respect  to  Amid 
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we  still  have  to  deal  with:  what  to  do  with  the  intervals  that  contain  Xmjj?  One 
possibility  would  be  to  store  such  intervals  in  both  subtrees.  For  the  children  of 
the  node,  however,  the  same  thing  could  happen  again.  Eventually  one  interval 
could  be  stored  many  times,  and  the  amount  of  storage  our  data  structure  uses 
could  become  very  large.  To  avoid  the  proliferation  of  intervals  we  deal  with 
the  problem  differently:  we  store  the  set  7mid  of  intervals  containing  xm;d  in  a 
separate  structure  and  associate  that  structure  with  the  root  of  our  tree.  See 
Figure  10.2  for  the  situation.  Note  that  in  this  figure  (and  others),  although 
the  intervals  lie  on  a  real  line,  we  draw  them  on  slightly  different  heights  to 
distinguish  them. 

The  associated  structure  should  enable  us  to  report  the  intervals  in  7mid  that 
contain  qx.  So  we  ended  up  with  the  same  problem  that  we  started  with:  given 
a  set  /nli(|  of  intervals,  find  those  that  contain  qx.  But  if  we  have  bad  luck  llm(\ 
could  be  the  same  as  I.  It  seems  we  are  back  to  exactly  the  same  problem,  but 
there  is  a  difference.  We  know  that  all  the  intervals  in  Ima  contain  xrrMCi,  and  this 
helps  a  great  deal.  Suppose,  for  example,  that  qx  lies  to  the  left  of  xmid.  In  that 
case  we  already  know  that  the  right  endpoint  of  all  intervals  in  7mid  lies  right 
of  qx.  So  only  the  left  endpoints  of  the  intervals  are  important:  qx  is  contained 
in  an  interval  \xj  :  x'-\  £  /mid  if  and  only  if  Xj  ^  qx.  If  we  store  the  intervals  in 
a  list  ordered  on  increasing  left  endpoint,  then  qx  can  only  be  contained  in  an 
interval  if  qx  is  also  contained  in  all  its  predecessors  in  the  sorted  list.  In  other 
words,  we  can  simply  walk  along  the  sorted  list  reporting  intervals,  until  we 
come  to  an  interval  that  does  not  contain  qx.  At  that  point  we  can  stop:  none 
of  the  remaining  intervals  can  contain  qx.  Similarly,  when  qx  lies  right  of  xmKi 
we  can  walk  along  a  list  that  stores  the  intervals  sorted  on  right  endpoint.  This 
list  must  be  sorted  on  decreasing  right  endpoint,  because  it  is  traversed  if  the 
query  point  qx  lies  to  the  right  of  xmid.  Finally,  when  qx  =  xmid  we  can  report  all 
intervals  in  7m y.  (We  do  not  need  to  treat  this  as  a  separate  case.  We  can  simply 
walk  along  one  of  the  sorted  lists.) 


We  now  give  a  succinct  description  of  the  whole  data  structure  that  stores  the 
intervals  in  7.  The  data  structure  is  called  an  interval  tree.  Figure  10.3  shows  an 
interval  tree;  the  dotted  vertical  segments  indicate  the  values  xmjd  for  each  node. 
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If  7  =  0  then  the  interval  tree  is  a  leaf. 


An  interval  tree 


■  Otherwise,  let  AmKi  be  the  median  of  the  endpoints  of  the  intervals.  Let 

/left  -=  {['V j  '■  Xj\  £  l  '.  Xj  <  Amid}, 

/mid  •  {  [x j  •  Xj]  £  I  .  Xj  ^  Amid  ^  X j}, 

/right  •  {  [x j  •  X  j]  £  I  .  Amid  A  A  j}. 

The  interval  tree  consists  of  a  root  node  v  storing  Amid-  Furthermore, 

■  the  set  /mjd  is  stored  twice;  once  in  a  list  Lieft(v)  that  is  sorted  on  the 
left  endpoints  of  the  intervals,  and  once  in  a  list  Lright(v)  that  is  sorted 
on  the  right  endpoints  of  the  intervals, 

■  the  left  subtree  of  v  is  an  interval  tree  for  the  set  4*, 

■  the  right  subtree  of  v  is  an  interval  tree  for  the  set  /right  - 

Lemma  10.2  An  interval  tree  on  a  set  of  n  intervals  uses  0(n )  storage  and  has 
depth  O(\ogn). 

Proof.  The  bound  on  the  depth  is  trivial,  so  we  prove  the  storage  bound.  Note 
that  /ieft,  /mid.  and  /right  are  disjoint  subsets.  As  a  result,  each  interval  is  only 
stored  in  a  set  /m id  once  and.  hence,  only  appears  once  in  the  two  sorted  lists. 
This  shows  that  the  total  amount  of  storage  required  for  all  associated  lists  is 
bounded  by  0(n).  The  tree  itself  uses  0(n)  storage  as  well.  ED 

The  following  recursive  algorithm  for  building  an  interval  tree  follows  directly 
from  its  definition.  (Recall  that  lc(v )  and  rc( v)  denote  the  left  and  right  child, 
respectively,  of  a  node  v.) 

Algorithm  ConstructIntervalTree(/) 

Input.  A  set  I  of  intervals  on  the  real  line. 

Output.  The  root  of  an  interval  tree  for  /. 

1.  if  /  =  0 

2.  then  return  an  empty  leaf 

3.  else  Create  a  node  v.  Compute  Amid.  the  median  of  the  set  of  interval 

endpoints,  and  store  Amid  with  v. 
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6. 
7. 


Compute  7mid  and  construct  two  sorted  lists  for  a  list  £.|eft(v) 

sorted  on  left  endpoint  and  a  list  £right(v)  sorted  on  right  endpoint. 

Store  these  two  lists  at  v. 

lc(v)  <—  CONSTRUCTlNTERVALTREE(/ieft) 

rc(v)  <—  CONSTRUCTlNTERVALTREE(/right) 

return  v 


Finding  the  median  of  a  set  of  points  takes  linear  time.  Actually,  it  is  better  to 
compute  the  median  by  presorting  the  set  of  points,  as  in  Chapter  5.  It  is  easy  to 
maintain  these  presorted  sets  through  the  recursive  calls.  Let  :=  card(/m;d). 
Creating  the  lists  Cieft(v)  and  £rjght(v)  takes  <9(nmidl0gHmid)  time.  Hence, 
the  time  we  spend  (not  counting  the  time  needed  for  the  recursive  calls)  is 
0(n  +  «mid  lognmjd).  Using  similar  arguments  as  in  the  proof  of  Lemma  10.2 
we  can  conclude  that  the  algorithm  runs  in  0{n\ogn)  time. 

Lemma  10.3  An  interval  tree  on  a  set  of  n  intervals  can  be  built  in  0(n  log/;) 
time. 


It  remains  to  show  how  to  use  the  interval  tree  to  find  the  intervals  containing 
a  query  point  qx.  We  already  sketched  how  to  do  this,  but  now  we  can  give  the 
exact  algorithm. 

Algorithm  QueryIntervalTree(v,  qx) 

Input.  The  root  v  of  an  interval  tree  and  a  query  point  qx. 

Output.  All  intervals  that  contain  qx. 

1 .  if  v  is  not  a  leaf 

2.  then  if  qx  <  xmici(v) 

3.  then  Walk  along  the  list  L|ett(v),  starting  at  the  interval  with  the 

leftmost  endpoint,  reporting  all  the  intervals  that  contain  qx. 
Stop  as  soon  as  an  interval  does  not  contain  qx. 

4.  QueryIntervalTree(/c(v),^v) 

5.  else  Walk  along  the  list  Cright(v),  starting  at  the  interval  with  the 

rightmost  endpoint,  reporting  all  the  intervals  that  contain 
qx.  Stop  as  soon  as  an  interval  does  not  contain  qx. 

6.  QUERYlNTERVALTREE(?r(v),g.v) 

Analyzing  the  query  time  is  not  very  difficult.  At  any  node  v  that  we  visit  we 
spend  (9(1  +kv)  time,  where  kv  is  the  number  of  intervals  that  we  report  at  v. 
The  sum  of  the  kv’s  over  the  visited  nodes  is,  of  course,  k.  Furthermore,  we 
visit  at  most  one  node  at  any  depth  of  the  tree.  As  noted  above,  the  depth  of  the 
interval  tree  is  (9 (log/;).  So  the  total  query  time  is  Of  log  //  +  k). 


The  following  theorem  summarizes  the  results  about  interval  trees. 

Theorem  10.4  An  interval  tree  for  a  set  I  ofn  intervals  uses  0(n)  storage  and 
can  be  built  in  0(n log n)  time.  Using  the  interval  tree  we  can  report  all  intervals 
that  contain  a  query  point  in  O(\ogn  +  k)  time,  where  k  is  the  number  of  reported 
intervals. 
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It’s  time  to  pause  a  moment  and  see  where  all  this  has  brought  us.  The 
problem  we  originally  wanted  to  solve  is  this:  store  a  set  S  of  axis-parallel 
segments  in  a  data  structure  that  allows  us  to  find  the  segments  intersecting  a 
query  window  W  =  [x :  x'}  x  \y  :  y'] .  Finding  the  segments  that  have  an  endpoint 
inside  W  could  be  done  using  a  data  structure  from  Chapter  5,  the  range  tree. 
The  other  segments  intersecting  W  had  to  intersect  the  boundary  of  W  twice. 
We  planned  to  find  these  segments  by  querying  with  the  left  and  top  edges  of  W. 
So  we  needed  a  data  structure  that  stores  a  set  of  horizontal  segments  such  that 
the  ones  intersecting  a  vertical  query  segment  can  be  reported  efficiently,  and  a 
similar  data  structure  storing  the  vertical  segments  that  allows  for  intersection 
queries  with  horizontal  segments.  We  started  by  developing  a  data  structure  that 
solves  a  slightly  simpler  problem,  namely  where  the  query  object  is  a  full  line. 
This  led  to  the  interval  tree.  Now  let’s  see  how  to  extend  the  interval  tree  to  the 
case  where  the  query  object  is  a  vertical  line  segment. 

Let  Sh  C  S  be  the  subset  of  horizontal  segments  in  S ,  and  let  q  be  the 
vertical  query  segment  qx  x  [qy  :  q'] .  For  a  segment  s  :=  [sx  :  s'x]  x  sy  in  5//,  we 
call  [sjc  :  s',.]  the  x-interval  of  the  segment.  Suppose  we  have  stored  the  segments 
in  Sh  in  an  interval  tree  7  according  to  their  x-intervals.  Let’s  go  through  the 
query  algorithm  QueryIntervalTree  to  see  what  happens  when  we  query  T 
with  the  vertical  query  segment  q.  Suppose  qx  lies  to  the  left  of  the  A'm;d-value 
stored  at  the  root  of  the  interval  tree  7.  It  is  still  correct  that  we  only  search 
recursively  in  the  left  subtree:  segments  completely  to  the  right  of.Ymjd  cannot  be 
intersected  by  q  so  we  can  skip  the  right  subtree.  The  way  the  set  7mid  is  treated, 
however,  is  not  correct  anymore.  For  a  segment  ,v  £  /ml(|  to  be  intersected  by  q, 
it  is  not  sufficient  that  its  left  endpoint  lies  to  the  left  of  q\  it  is  also  required 
that  its  y-coordinate  lies  in  the  range  \qy  :  q[  \.  Figure  10.4  illustrates  this.  So 
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Figure  10.4 

Segments  intersected  by  q  must  have 
their  left  endpoint  in  the  shaded  region 


storing  the  endpoints  in  an  ordered  list  is  not  enough.  We  need  a  more  elaborate 
associated  structure:  given  a  query  range  (— °°  :  qx\  x  [qy  :  q'y],  we  must  be  able 
to  report  all  the  segments  whose  left  endpoint  lies  in  that  range.  If  q  lies  to 

the  right  of  .vmKi  we  want  to  report  all  segments  whose  right  endpoint  lies  in  _ 

the  range  [qx  :  +°°)  x  \qy  :  q\  | ,  so  for  this  case  we  need  a  second  associated  225 


Chapter  10  structure.  How  should  we  implement  the  associated  structure?  Well,  the  query 

more  geometric  data  structures  that  we  wish  to  perform  is  nothing  more  than  a  rectangular  range  query  on  a  set 

of  points.  A  2-dimensional  range  tree,  described  in  Chapter  5,  will  therefore  do 
the  trick.  This  way  the  associated  structure  uses  0  (timid  log  ^mid)  storage,  where 
umid  :=  card(/mjd),  and  its  query  time  is  O(\ognm^  +  k). 

The  data  structure  that  stores  the  set  Sh  of  horizontal  segments  is  now  as 
follows.  The  main  structure  is  an  interval  tree  T  on  the  x-intervals  of  the 
segments.  Instead  of  the  sorted  lists  £ieft(v)  and  £ns,|U(  vj  we  have  two  range 
trees:  a  range  tree  Tieft(v)  on  the  left  endpoints  of  the  segments  in  /mid(v), 
and  a  range  tree  Tr;ght(v)  on  the  right  endpoints  of  the  segments  in  /mid(v). 
Because  the  storage  required  for  a  range  tree  is  a  factor  log  n  larger  than  for 
sorted  lists,  the  total  amount  of  storage  for  the  data  structure  becomes  0(n\ogn). 
The  preprocessing  time  remains  O(nlogn). 

The  query  algorithm  is  the  same  as  QueryIntervalTree,  except  that, 
instead  of  walking  along  the  sorted  list  £]eft(v),  say,  we  perform  a  query  in  the 
range  tree  ,Tieft(v).  So  at  each  of  the  O(k)gii)  nodes  v  on  the  search  path  we 
spend  O(\ogn  +  kv)  time,  where  kv  is  the  number  of  reported  segments.  The 
total  query  time  therefore  becomes  O(log2n  +  k). 

We  have  proved  the  following  theorem. 

Theorem  10.5  Let  Sbea  set  of  n  horizontal  segments  in  the  plane.  The  segments 
intersecting  a  vertical  query  segment  can  be  reported  in  O(log2  n  +  k)  time  with 
a  data  structure  that  uses  0(n  log/;)  storage,  where  k  is  the  number  of  reported 
segments.  The  structure  can  be  built  in  0(n  logn)  time. 

If  we  combine  this  with  the  result  of  Lemma  10.1  we  get  a  solution  to  the 
windowing  problem  for  axis-parallel  segments. 

Corollary  10.6  Let  S  be  a  set  of  n  axis-parallel  segments  in  the  plane.  The 
segments  intersecting  an  axis-parallel  rectangular  query  window  can  be  reported 
in  Oflog2  n  +  k)  time  with  a  data  structure  that  uses  0(n  log/;)  storage,  where 
k  is  the  number  of  reported  segments.  The  structure  can  be  built  in  0(n  log  n) 
time. 


10.2  Priority  Search  Trees 

In  the  structure  for  windowing  described  in  Section  10.1  we  used  a  range  tree  for 
the  associated  structures.  The  range  queries  we  perform  on  them  have  a  special 
property:  they  are  unbounded  on  one  side.  In  this  section  we  will  describe 
a  different  data  structure,  the  priority  search  tree,  that  uses  this  property  to 
improve  the  bound  on  storage  to  O(n).  This  data  structure  is  also  a  lot  simpler 
because  it  does  not  require  fractional  cascading.  Using  priority  search  trees 
instead  of  range  trees  in  the  data  structure  for  windowing  reduces  the  storage 
bound  in  Theorem  10.5  to  0{n).  It  does  not  improve  the  storage  bound  in 


226 


Corollary  10.6  because  there  we  also  need  a  range  tree  to  report  the  endpoints  Section  10.2 

that  lie  in  the  window.  priority  search  trees 

Let  P  :=  {p\,P2,  ■  ■  ■  iPn}  be  a  set  of  points  in  the  plane.  We  want  to  design 
a  structure  for  rectangular  range  queries  of  the  form  (— °°  :  qx]  x  [qy  :  q'y\.  To 
get  some  idea  of  how  this  special  property  can  be  used,  let’s  look  at  the  1- 
dimensional  case.  A  normal  1 -dimensional  range  query  would  ask  for  the  points 
lying  in  a  range  [q'x  :  qx].  To  find  these  points  efficiently  we  can  store  the  set  of 
points  in  a  1 -dimensional  range  tree,  as  described  in  Chapter  5.  If  the  range  is 
unbounded  to  the  left,  we  are  asking  for  the  points  lying  in  (— °°  :  qx\.  This  can 
be  solved  by  simply  walking  along  an  ordered  list  starting  at  the  leftmost  point, 
until  we  encounter  a  point  that  is  not  in  the  range.  The  query  time  is  (9(1  +k), 
instead  of  (9(log n  +  k)  which  we  needed  in  the  general  case. 

How  can  we  extend  this  strategy  to  2-dimensional  range  queries  that  are 
unbounded  to  the  left?  Somehow  we  must  integrate  information  about  the 
y-coordinate  in  the  structure  without  using  associated  structures,  so  that,  among 
the  points  whose  v-coordinate  is  in  (— °°  :  qx],  we  can  easily  select  the  ones 
whose  y-coordinate  is  in  [qy  :  q'y],  A  simple  linear  list  doesn’t  lend  itself  well 
for  this.  Therefore  we  take  a  different  structure  to  work  with:  the  heap. 


Figure  10.5 
A  heap  for  the  set 
{1,3,4,8,11,15,21,22,36} 


A  heap  is  normally  used  for  priority  queries  that  ask  for  the  smallest  (or  largest) 
value  in  a  set.  But  heaps  can  also  be  used  to  answer  1-dimensional  range  queries 
of  the  form  (— °°  :  qx}.  A  heap  has  the  same  query  time  as  a  sorted  list,  namely 
0(1  +  k).  Normally  the  advantage  of  a  heap  over  a  sorted  list  is  that  points  can 
be  inserted  and  the  maximum  deleted  more  efficiently.  For  us  the  tree  structure 
of  a  heap  has  another  advantage:  it  makes  it  easier  to  integrate  information 
about  the  y-coordinate,  as  we  shall  see  shortly.  A  heap  is  a  binary  tree  defined 
as  follows.  The  root  of  the  tree  stores  the  point  with  minimum  a- value.  The 
remainder  of  the  set  is  partitioned  into  two  subsets  of  almost  equal  size,  and 
these  subsets  are  stored  recursively  in  the  same  way.  Figure  10.5  gives  an 
example  of  a  heap.  We  can  do  a  query  with  (— °°  :  qx]  by  walking  down  the  tree. 
When  we  visit  a  node  we  check  if  the  v-coordinate  of  the  point  stored  at  the 
node  lies  in  (— °°  :  qx\.  If  it  does,  we  report  the  point  and  continue  the  search 
in  both  subtrees;  otherwise,  we  abort  the  search  in  this  part  of  the  tree.  For 
example,  when  we  search  with  (— °°  :  5]  in  the  tree  of  Figure  10.5,  we  report  the 
points  1.  3,  and  4.  We  also  visit  the  nodes  with  8  and  1 1  but  abort  the  search 
there. 
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Figure  10.6 

A  set  of  points  and  the  corresponding 
priority  search  tree 


Heaps  give  us  some  freedom  in  how  to  partition  the  set  into  two  subsets. 
If  we  also  want  to  search  on  y-coordinate  then  the  trick  is  to  perform  the 
partitioning  not  in  an  arbitrary  way,  as  is  the  case  for  normal  heaps,  but  according 
to  y-coordinate.  More  precisely,  we  split  the  remainder  of  the  set  into  two  subsets 
of  almost  equal  size  such  that  the  y-coordinate  of  any  point  in  one  subset  is 
smaller  than  the  y-coordinate  of  any  point  in  the  other  subset.  This  is  illustrated 
in  Figure  10.6.  The  tree  is  drawn  sideways  to  indicate  that  the  partitioning 
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is  on  y-coordinate.  In  the  example  of  Figure  10.6  the  point  p$  has  smallest 
x-cccrdinate  and,  hence,  is  stored  in  the  root.  The  other  points  are  partitioned 
in  y-coordinate.  The  points  py,  /;4,  and  /y,  have  smaller  y-coordinate  and  are 
stored  in  the  left  subtree.  Of  these  py  has  smallest  x-coordinate,  so  this  point  is 
placed  in  the  root  of  the  subtree,  and  so  on. 

A  formal  definition  of  a  priority  search  tree  for  a  set  P  of  points  is  as  follows. 
We  assume  that  all  the  points  have  distinct  coordinates.  In  Chapter  5  (more 
precisely,  in  Section  5.5)  we  saw  that  this  involves  no  loss  of  generality;  by 
using  composite  numbers  we  can  simulate  that  all  coordinates  are  distinct. 

■  If  P  —  0  then  the  priority  search  tree  is  an  empty  leaf. 

■  Otherwise,  let  /;mm  be  the  point  in  the  set  P  with  the  smallest  x-coordinate. 
Let  ymi(j  be  the  median  of  the  y-coordinates  of  the  remaining  points.  Let 

Fbvlow  •  { P  F  P  \  { p m i n  }  ■  Py  A  ymid} • 

f  above  -  =  { P  €  P  \  { p m i n  }  ■  Py  >  ymid}- 

The  priority  search  tree  consists  of  a  root  node  v  where  the  point  p(v)  := 
pmin  and  the  value  y(v)  :=  ymjd  are  stored.  Furthermore, 

■  the  left  subtree  of  v  is  a  priority  search  tree  for  the  set  Pbciowi 

■  the  right  subtree  of  v  is  a  priority  search  tree  for  the  set  Pabove- 

It’s  straightforward  to  derive  a  recursive  O(n\ogn)  algorithm  for  building  a 
priority  search  tree.  Interestingly,  priority  search  trees  can  even  be  built  in  linear 
time,  if  the  points  are  already  sorted  on  y-coordinate.  The  idea  is  to  construct 
the  tree  bottom-up  instead  of  top-down,  in  the  same  way  heaps  are  normally 
constructed. 

A  query  with  a  range  (— °°  :  qx\  x  [qy  :  q'x]  in  a  priority  search  tree  is  performed 
roughly  as  follows.  We  search  with  qy  and  q'  as  indicated  in  Figure  10.7.  All 
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the  shaded  subtrees  in  the  figure  store  only  points  whose  y-coordinate  lies  in  the 
correct  range.  So  we  can  search  those  subtrees  based  on  x-coordinate  only.  This 
is  done  with  the  following  subroutine,  which  is  basically  the  query  algorithm 
for  a  heap. 


ReportInSubtree(v,  qx) 

Input.  The  root  v  of  a  subtree  of  a  priority  search  tree  and  a  value  qx. 

Output.  All  points  in  the  subtree  with  x-coordinate  at  most  qx. 

1.  if  V  is  not  a  leaf  and  (p(v))x  f  qx 

2.  then  Report  p(v). 

3.  ReportInSubtree(/c(v),^.v) 

4.  REPORTlNSUBTREE(rc(v),gA) 

Lemma  10.7  ReportInSubtree(  v,^a)  reports  in  (9(1  +kv)  time  all  points  in 
the  subtree  rooted  at  v  whose  x-coordinate  is  at  most  qx,  where  kv  is  the  number 
of  reported  points. 

Proof.  Let  p(p)  be  a  point  with  (p(p))x  f  qx  that  is  stored  at  a  node  p  in  the 
subtree  rooted  at  V.  By  definition  of  the  data  structure,  the  x-coordinates  of  the 
points  stored  at  the  path  from  p  to  v  form  a  decreasing  sequence,  so  all  these 
points  must  have  x-coordinates  at  most  qx.  Hence,  the  search  is  not  aborted  at 
any  of  these  nodes,  which  implies  that  p  is  reached  and  p(p)  is  reported.  We 
conclude  that  all  points  with  x-coordinate  at  most  qx  are  reported.  Obviously, 
those  are  the  only  reported  points. 

At  any  node  p  that  we  visit  we  spend  (9(1)  time.  When  we  visit  a  node  p 
with  ]U  /  v  we  must  have  reported  a  point  at  the  parent  of  p .  We  charge  the 
time  we  spend  at  p  to  this  point.  This  way  any  reported  point  gets  charged 
twice,  which  means  that  the  time  we  spend  at  nodes  p  with  p  /  V  is  0(kv). 
Adding  the  time  we  spend  at  v  we  get  a  total  of  (9(  1  +  kv)  time.  0 

If  we  call  ReportInSubtree  at  each  of  the  subtrees  that  we  select  (the  shaded 
subtrees  of  Figure  10.7),  do  we  find  all  the  points  that  lie  in  the  query  range?  The 
answer  is  no.  The  root  of  the  tree,  for  example,  stores  the  point  with  smallest 
x-coordinate.  This  may  very  well  be  a  point  in  the  query  range.  In  fact,  any 
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Figure  10.7 

Querying  a  priority  search  tree 
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Chapter  10  point  stored  at  a  node  on  the  search  path  to  qy  or  q'y  may  lie  in  the  query  range, 
more  geometric  data  structures  so  we  should  test  them  as  well.  This  leads  to  the  following  query  algorithm. 

Algorithm  QueryPrioSearchTree(T,  (—00  ;  qx]  x  \qy  :  q\] ) 

Input.  A  priority  search  tree  and  a  range,  unbounded  to  the  left. 

Output.  All  points  lying  in  the  range. 

1 .  Search  with  qy  and  q'y  in  T.  Let  vspijt  be  the  node  where  the  two  search 
paths  split. 

2.  for  each  node  v  on  the  search  path  of  qy  or  q[, 

3.  do  if  p(v)  €  (  — 00  qx\  X  Vly  :  q'y]  then  report  p(v). 

4.  for  each  node  v  on  the  path  of  qy  in  the  left  subtree  of  vspiit 

5.  do  if  the  search  path  goes  left  at  v 

6.  then  REPORTlNSUBTREE(rc(v),g.v) 

7.  for  each  node  v  on  the  path  of  q[,  in  the  right  subtree  of  vspijt 

8.  do  if  the  search  path  goes  right  at  v 

9.  then  ReportInSubtree(/c(v),^.v) 

Lemma  10.8  Algorithm  QueryPrioSearchTree  reports  the  points  in  a 
query  range  (— °°  :  qx\  x  [qy  :  q'y]  in  O(\og  n  +  k)  time,  where  k  is  the  number  of 
reported  points. 

Proof.  First  we  prove  that  any  point  that  is  reported  by  the  algorithm  lies  in 
the  query  range.  For  the  points  on  the  search  paths  to  qy  and  q'y  this  is  obvious: 
these  points  are  tested  explicitly  for  containment  in  the  range.  Consider  a  call 
ReportInSubtree(cc(  v) ,  qx)  in  line  6.  Let  p  be  a  point  that  is  reported  in  this 
call.  By  Lemma  10.7  we  have  px  f  qx.  Furthermore,  py  f  q'  because  all  the 
nodes  visited  in  this  call  lie  to  the  left  of  vspiit  and  q'y  >  y(vspiit).  Finally,  py  ^  qy, 
because  all  the  nodes  visited  in  this  call  lie  to  the  right  of  v  and  the  search  path 
to  qy  went  left  at  V.  A  similar  argument  applies  for  the  points  reported  in  line  9. 

We  have  proved  that  all  the  reported  points  lie  in  the  query  range.  Conversely, 
let  p(p)  be  a  point  in  the  range.  Any  point  stored  to  the  left  of  a  node  where  the 
search  path  to  qy  goes  right  must  have  a  y-coordinate  smaller  than  qy.  Similarly, 
any  point  stored  to  the  right  of  a  node  where  the  search  path  to  q'y  goes  left 
must  have  a  v-coordinate  greater  than  q'y.  Hence,  p  must  either  be  on  one  of  the 
search  paths,  or  in  one  of  the  subtrees  for  which  ReportInSubtree  is  called. 
In  both  cases  p(p)  will  be  reported. 

It  remains  to  analyze  the  time  taken  by  the  algorithm.  This  is  linear  in  the 
number  of  nodes  on  the  search  paths  to  qy  and  q[,  plus  the  time  taken  by  all  the 
executions  of  the  procedure  ReportInSubtree.  The  depth  of  the  tree  and. 
hence,  the  number  of  nodes  on  the  search  paths,  is  O(\ogn).  The  time  taken  by 
all  executions  of  ReportInSubtree  is  O(\ogn  +  k)  by  Lemma  10.7.  0 

The  performance  of  priority  search  trees  is  summarized  in  the  following  theorem. 

Theorem  10.9  A  priority  search  tree  for  a  set  P  of  n  points  in  the  plane  uses 
O(n)  storage  and  can  be  built  in  0(n  log/;)  time.  Using  the  priority  search  tree 
we  can  report  all  points  in  a  query  range  of  the  form  (— °°  ;  qf  x  [qy  :  q[]  in 
Odogn  +  k)  time,  where  k  is  the  number  of  reported  points. 
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10.3  Segment  Trees 

So  far  we  have  studied  the  windowing  problem  for  a  set  of  axis-parallel  line 
segments.  We  developed  a  nice  data  structure  for  this  problem  using  interval 
trees  with  priority  search  trees  as  associated  structure.  The  restriction  to  axis- 
parallel  segments  was  inspired  by  the  application  to  printed  circuit  board  design. 
When  we  are  doing  windowing  queries  in  roadmaps,  however,  we  must  drop 
this  restriction:  roadmaps  contain  line  segments  at  arbitrary  orientation. 

There  is  a  trick  that  we  can  use  to  reduce  the  general  problem  to  a  problem 
on  axis-parallel  segments.  We  can  replace  each  segment  by  its  bounding  box. 
Using  the  data  structure  for  axis-parallel  segments  that  we  developed  earlier, 
we  can  now  find  all  the  bounding  boxes  that  intersect  the  query  window  W. 
We  then  check  every  segment  whose  bounding  box  intersects  W  to  see  if  the 
segment  itself  also  intersects  W.  In  practice  this  technique  usually  works  quite 
well:  the  majority  of  the  segments  whose  bounding  box  intersects  W  will  also 
intersect  W  themselves.  In  the  worst  case,  however,  the  solution  is  quite  bad: 
all  bounding  boxes  may  intersect  W  whereas  none  of  the  segments  do.  So  if  we 
want  to  guarantee  a  fast  query  time,  we  must  look  for  another  method. 

As  before  we  make  a  distinction  between  segments  that  have  an  endpoint  in 
the  window  and  segments  that  intersect  the  window  boundary.  The  first  type  of 
segments  can  be  reported  using  a  range  tree.  To  find  the  answers  of  the  second 
type  we  perform  an  intersection  query  with  each  of  the  four  boundary  edges 
of  the  window.  (Of  course  care  has  to  be  taken  that  answers  are  reported  only 
once.)  We  will  only  show  how  to  perform  queries  with  vertical  boundary  edges. 
For  the  horizontal  edges  a  similar  approach  can  be  used.  So  we  are  given  a  set  S 
of  line  segments  with  arbitrary  orientations  in  the  plane,  and  we  want  to  find 
those  segments  in  S  that  intersect  a  vertical  query  segment  q:=qxx  [qy :  q'y].  We 
will  assume  that  the  segments  in  S  don’t  intersect  each  other,  but  we  allow  them 
to  touch.  (For  intersecting  segments  the  problem  is  a  lot  harder  to  solve  and  the 
time  bounds  are  worse.  Techniques  like  the  ones  described  in  Chapter  16  are 
required  in  this  case.) 

Let’s  first  see  if  we  can  adapt  the  solution  of  the  previous  sections  to  the 
case  of  arbitrarily  oriented  segments.  By  searching  with  qx  in  the  interval  tree 
we  select  a  number  of  subsets  /mid(v).  For  a  selected  node  v  with  .Ym;d(v)  >  qx, 
the  right  endpoint  of  any  segment  in  /mid(v)  lies  to  the  right  of  q.  If  the  segment 
is  horizontal,  then  it  is  intersected  by  the  query  segment  if  and  only  if  its  left 
endpoint  lies  in  the  range  (— °°  :  qx\  x  \qy  :  q\\ .  If  the  segments  have  arbitrary 
orientation,  however,  things  are  not  so  simple:  knowing  that  the  right  endpoint  of 
a  segment  is  to  the  right  of  q  doesn’t  help  us  much.  The  interval  tree  is  therefore 
not  very  useful  in  this  case.  Let’s  try  to  design  a  different  data  structure  for 
the  1 -dimensional  problem,  one  that  is  more  suited  for  dealing  with  arbitrarily 
oriented  segments. 
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One  of  the  paradigms  for  developing  data  structures  is  the  locus  approach.  A 
query  is  described  by  a  number  of  parameters;  for  the  windowing  problem, 
for  example,  there  are  four  parameters,  namely  qx,  q[,  qy.  and  q\.  For  each 
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choice  of  the  parameters  we  get  a  certain  answer.  Often  nearby  choices  give 
the  same  answer;  if  we  move  the  window  slightly  it  will  often  still  intersect 
the  same  collection  of  segments.  Let  the  parameter  space  be  the  space  of  all 
possible  choices  for  the  parameters.  For  the  windowing  problem  this  space  is 
4-dimensional.  The  locus  approach  suggests  partitioning  the  parameter  space 
into  regions  such  that  queries  in  the  same  region  have  the  same  answer.  Hence, 
if  we  locate  the  region  that  contains  the  query  then  we  know  the  answer  to  it. 
Such  an  approach  only  works  well  when  the  number  of  regions  is  small.  For  the 
windowing  problem  this  is  not  true.  There  can  be  ©(«4)  different  regions.  But 
we  can  use  the  locus  approach  to  create  an  alternative  for  the  interval  tree. 

Let  I  :=  {[xi  :  xyj],  [x2  :  x?2],  ■  ■  ■ ,  [xn  :  x'n])  be  a  set  of  n  intervals  on  the  real 
line.  The  data  structure  that  we  are  looking  for  should  be  able  to  report  the 
intervals  containing  a  query  point  qx.  Our  query  has  only  one  parameter,  qx, 
so  the  parameter  space  is  the  real  line.  Let  p\,p2,...  ■  pm  be  the  list  of  distinct 
interval  endpoints,  sorted  from  left  to  right.  The  partitioning  of  the  parameter 
space  is  simply  the  partitioning  of  the  real  line  induced  by  the  points  p,-.  We 
call  the  regions  in  this  partitioning  elementary  intervals.  Thus  the  elementary 
intervals  are,  from  left  to  right, 

(- °°-Pi),\pi  -Pi],(j>i  -P2),[P2 


{Pm—  1  •  Pm)i  [ Pm  •  Pm\->  ( Pm  •  “h00)- 

The  list  of  elementary  intervals  consists  of  open  intervals  between  two  consecu¬ 
tive  endpoints  p,  and  pl+  \ ,  alternated  with  closed  intervals  consisting  of  a  single 
endpoint.  The  reason  that  we  treat  the  points  p,  themselves  as  intervals  is,  of 
course,  that  the  answer  to  a  query  is  not  necessarily  the  same  at  the  interior  of 
an  elementary  interval  and  at  its  endpoints. 

To  find  the  intervals  that  contain  a  query  point  qx,  we  must  determine  the 
elementary  interval  that  contains  qx.  To  this  end  we  build  a  binary  search  tree  T 
whose  leaves  correspond  to  the  elementary  intervals.  We  denote  the  elementary 
interval  corresponding  to  a  leaf  ix  by  Int(/x ). 

If  all  the  intervals  in  I  containing  Int(/x)  are  stored  at  the  leaf  /x,  then  we  can 
report  the  k  intervals  containing  qx  in  (9(log n  +  k)  time:  first  search  in  0(log  n) 
time  with  qx  in  T,  and  then  report  all  the  intervals  stored  at  Ll  in  ()( 1  +  k)  time. 
So  queries  can  be  answered  efficiently.  But  what  about  the  storage  requirement 
of  the  data  structure?  Intervals  that  span  a  lot  of  elementary  intervals  are  stored 
at  many  leaves  in  the  data  structure.  Hence,  the  amount  of  storage  will  be  high 
if  there  are  many  pairs  of  overlapping  intervals.  If  we  have  bad  luck  the  amount 
of  storage  can  even  become  quadratic.  Let’s  see  if  we  can  do  something  to 
reduce  the  amount  of  storage.  In  Figure  10.8  you  see  an  interval  that  spans  five 
elementary  intervals.  Consider  the  elementary  intervals  corresponding  to  the 
leaves  p.\,  p.2~  1*3,  and  AH-  When  the  search  path  to  qx  ends  in  one  of  those  leaves 
we  must  report  the  interval.  The  crucial  observation  is  that  a  search  path  ends  in 
/ij ,  /i2,  ,1X3 ,  or  1X4  if  and  only  if  the  path  passes  through  the  internal  node  v.  So 
why  not  store  the  interval  at  node  v  (and  at  /X5)  instead  of  at  the  leaves  Ll\ ,  /io, 
/X3,  and  t/4  (and  at  /X5)?  In  general,  we  store  an  interval  at  a  number  of  nodes 
that  together  cover  the  interval,  and  we  choose  these  nodes  as  high  as  possible. 
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Figure  10.8 

The  segment  s  is  stored  at  v  instead  of 
at  jl  \ ,  fAo,  H3,  and  gt 4 


The  data  structure  based  on  this  principle  is  called  a  segment  tree.  We  now 
describe  the  segment  tree  for  a  set  I  of  intervals  more  precisely.  Figure  10.9 
shows  a  segment  tree  for  a  set  of  five  intervals. 


Figure  10.9 

A  segment  tree:  the  arrows  from  the 
nodes  point  to  their  canonical  subsets 


■  The  skeleton  of  the  segment  tree  is  a  balanced  binary  tree  T.  The  leaves  of 
T  correspond  to  the  elementary  intervals  induced  by  the  endpoints  of  the 
intervals  in  I  in  an  ordered  way:  the  leftmost  leaf  corresponds  to  the  leftmost 
elementary  interval,  and  so  on.  The  elementary  interval  corresponding  to 
leaf  /t  is  denoted  Int(/t). 

■  The  internal  nodes  of  7  correspond  to  intervals  that  are  the  union  of  ele¬ 
mentary  intervals:  the  interval  Int(v)  corresponding  to  node  v  is  the  union 
of  the  elementary  intervals  Int(jtl)  of  the  leaves  in  the  subtree  rooted  at  v. 

(This  implies  that  Int(v)  is  the  union  of  the  intervals  of  its  two  children.) 

■  Each  node  or  leaf  v  in  T  stores  the  interval  Int(v)  and  a  set  /(v)  Cl  of  inter¬ 
vals  (for  example,  in  a  linked  list).  This  canonical  subset  of  node  v  contains 
the  intervals  [x :  x!\  €  /  such  that  Int(v)  C  [x :  x']  and  lnt(parent(v))  %  [x :  x']. 

Let’s  see  if  our  strategy  of  storing  intervals  as  high  as  possible  has  helped  to  _ 

reduce  the  amount  of  storage.  233 


Lemma  10.10  A  segment  tree  on  a  set  of  n  intervals  uses  O(nlogn)  storage. 

Proof.  Because  7  is  a  balanced  binary  search  tree  with  at  most  4 n  +  1  leaves,  its 
height  is  O(\ogn).  We  claim  that  any  interval  [x :  x'}  €  /  is  stored  in  the  set  I(v) 
for  at  most  two  nodes  at  the  same  depth  of  the  tree.  To  see  why  this  is  true,  let 
Vi,V2,  V3  be  three  nodes  at  the  same  depth,  numbered  from  left  to  right.  Suppose 
[. x  :  x']  is  stored  at  Vi  and  V3.  This  means  that  [. x  :  x!\  spans  the  whole  interval 
from  the  left  endpoint  of  Int(vi)  to  the  right  endpoint  of  Int(v3).  Because  v2 
lies  between  V|  and  V3,  Int (parent(yf))  must  be  contained  in  [x  :  x'].  Hence, 
[. x :  x!\  will  not  be  stored  at  v2.  It  follows  that  any  interval  is  stored  at  most  twice 
at  a  given  depth  of  the  tree,  so  the  total  amount  of  storage  is  O(n\ogn).  0 

So  the  strategy  has  helped:  we  have  reduced  the  worst-case  amount  of 
storage  from  quadratic  to  0(n  log/;).  But  what  about  queries:  can  they  still  be 
answered  easily?  The  answer  is  yes.  The  following  simple  algorithm  describes 
how  this  is  done.  It  is  first  called  with  v  =  root{7). 

Algorithm  QuerySegmentTree(v,^) 

Input.  The  root  of  a  (subtree  of  a)  segment  tree  and  a  query  point  qx. 

Output.  All  intervals  in  the  tree  containing  qx. 

1.  Report  all  the  intervals  in  7(v). 

2.  if  v  is  not  a  leaf 

3.  then  if  qx  e  Int(/c(v)) 

4.  then  QuerySegmentTree(/c(v),^.v) 

5.  else  QUERYS  EGMENTTREE(rc( v ) ,  qx) 

The  query  algorithm  visits  one  node  per  level  of  the  tree,  so  O(\ogn)  nodes  in 
total.  At  a  node  v  we  spend  0(  1  +  kv)  time,  where  kv  is  the  number  of  reported 
intervals.  This  leads  to  the  following  lemma. 

Lemma  10.11  Using  a  segment  tree,  the  intervals  containing  a  query  point  qx 
can  be  reported  in  O(\ogn  +  k)  time,  where  k  is  the  number  of  reported  intervals. 

To  construct  a  segment  tree  we  proceed  as  follows.  First  we  sort  the  end¬ 
points  of  the  intervals  in  I  in  O(nlogn)  time.  This  gives  us  the  elementary 
intervals.  We  then  construct  a  balanced  binary  tree  on  the  elementary  intervals, 
and  we  determine  for  each  node  v  of  the  tree  the  interval  Int(v)  it  represents. 
This  can  be  done  bottom-up  in  linear  time.  It  remains  to  compute  the  canonical 
subsets  for  the  nodes.  To  this  end  we  insert  the  intervals  one  by  one  into  the 
segment  tree.  An  interval  is  inserted  into  Tby  calling  the  following  procedure 
with  v  =  rootfj). 

Algorithm  InsertSegmentTree(v,  [x  :  x!\) 

Input.  The  root  of  a  (subtree  of  a)  segment  tree  and  an  interval. 

Output.  The  interval  will  be  stored  in  the  subtree. 

1.  if  Int(v)  C  [x :  x'] 

2.  then  store  [x  :  xJ]  at  V 

3.  else  if  Int(/c(v))  n  [x :  x']  f  0 

234  4.  then  InsertSegmentTree(/c(v),  [x :  x']) 
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How  much  time  does  it  take  to  insert  an  interval  [x  :  x']  into  the  segment  tree? 

At  every  node  that  we  visit  we  spend  constant  time  (assuming  we  store  7(v)  in 
a  simple  structure  like  a  linked  list).  When  we  visit  a  node  v,  we  either  store 
[x  :  x']  at  v,  or  Int(v)  contains  an  endpoint  of  [. x  :  x'].  We  have  already  seen 
that  an  interval  is  stored  at  most  twice  at  each  level  of  T.  There  is  also  at  most 
one  node  at  every  level  whose  corresponding  interval  contains  x  and  one  node 
whose  interval  contains  x' .  So  we  visit  at  most  4  nodes  per  level.  Hence,  the 
time  to  insert  a  single  interval  is  O(\ogn),  and  the  total  time  to  construct  the 
segment  tree  is  O(nlogn). 

The  performance  of  segment  trees  is  summarized  in  the  following  theorem. 

Theorem  10.12  A  segment  tree  for  a  sell  ofn  intervals  uses  O(nlogn)  storage 
and  can  be  built  in  0(n  \ogn)  time.  Using  the  segment  tree  we  can  report  all 
intervals  that  contain  a  query  point  in  0( log  n  +  k)  time,  where  k  is  the  number 
of  reported  intervals. 

Recall  that  an  interval  tree  uses  only  linear  storage,  and  that  it  also  allows 
us  to  report  the  intervals  containing  a  query  point  in  O(\ogn  +  k)  time.  So  for 
this  task  an  interval  tree  is  to  be  preferred  to  a  segment  tree.  When  we  want  to 
answer  more  complicated  queries,  such  as  windowing  in  a  set  of  line  segments, 
then  a  segment  tree  is  a  more  powerful  structure  to  work  with.  The  reason  is  that 
the  set  of  intervals  containing  qx  is  exactly  the  union  of  the  canonical  subsets 
that  we  select  when  we  search  in  the  segment  tree.  In  an  interval  tree,  on  the 
other  hand,  we  also  select  O(\ogn)  nodes  during  a  query,  but  not  all  intervals 
stored  at  those  nodes  contain  the  query  point.  We  still  have  to  walk  along  the 
sorted  list  to  find  the  intersected  intervals.  So  for  segment  trees,  we  have  the 
possibility  of  storing  the  canonical  subsets  in  an  associated  structure  that  allows 
for  further  querying. 

Let’s  go  back  to  the  windowing  problem.  Let  S  be  a  set  of  arbitrarily  oriented, 
disjoint  segments  in  the  plane.  We  want  to  report  the  segments  intersecting  a 
vertical  query  segment  q  :=  qx  x  [qy  :  q[] .  Let’s  see  what  happens  when  we  build 
a  segment  tree  T  on  the  x-intervals  of  the  segments  in  S.  A  node  v  in  T  can  now 
be  considered  to  correspond  to  the  vertical  slab  Int(v)  x  (— °°  :  +<=°).  A  segment 
is  in  the  canonical  subset  of  v  if  it  completely  crosses  the  slab  corresponding 
to  v — we  say  that  the  segment  spans  the  slab — but  not  the  slab  corresponding 
to  the  parent  of  v.  We  denote  these  subsets  with  S(v).  See  Figure  10.10  for  an 
illustration. 

When  we  search  with  qx  in  T  we  find  O(\ogn)  canonical  subsets — those  of 
the  nodes  on  the  search  path — that  collectively  contain  all  the  segments  whose 
x-interval  contains  qx.  A  segment  ,v  in  such  a  canonical  subset  is  intersected 
by  q  if  and  only  if  the  lower  endpoint  of  q  is  below  ,v  and  the  upper  endpoint 
of  q  is  above  s.  How  do  we  find  the  segments  between  the  endpoints  of  ql 
Here  we  use  the  fact  that  the  segments  in  the  canonical  subset  .S’ (  V  j  span  the 


5.  if  Int(rc(v))  IT  [x  :  x']  ^  0 

6.  then  InsertSegmentTreeO-c(v),  [x  :  x7]) 
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Figure  10.10 

Canonical  subsets  contain  segments  that 
span  the  slab  of  a  node,  but  not  the  slab 
of  its  parent 


slab  corresponding  to  v  and  that  they  do  not  intersect  each  other.  This  implies 
that  the  segments  can  be  ordered  vertically.  Hence,  we  can  store  S(v)  in  a 
search  tree  T(v)  according  to  the  vertical  order.  By  searching  in  T(v)  we  can 
find  the  intersected  segments  in  O(logn  +  kv)  time,  where  kv  is  the  number  of 
intersected  segments.  The  total  data  structure  for  the  set  S  is  thus  as  follows. 

■  The  set  S  is  stored  in  a  segment  tree  T  based  on  the  x-intervals  of  the 
segments. 

■  The  canonical  subset  of  a  node  v  in  T,  which  contains  the  segments  spanning 
the  slab  corresponding  to  v  but  not  the  slab  corresponding  to  the  parent  of 
V,  is  stored  in  a  binary  search  tree  T(v)  based  on  the  vertical  order  of  the 
segments  within  the  slab. 

Because  the  associated  structure  of  any  node  v  uses  storage  linear  in  the  size  of 
S(v),  the  total  amount  of  storage  remains  O(nlogn).  The  associated  structures 
can  be  built  in  0(nlogn)  time,  leading  to  a  preprocessing  time  of  0(n  log2  n). 
With  a  bit  of  extra  work  this  can  be  improved  to  0(n\ogn).  The  idea  is  to 
maintain  a  (partial)  vertical  order  on  the  segments  while  building  the  segment 
tree.  With  this  order  available  the  associated  structures  can  be  computed  in 
linear  time. 


The  query  algorithm  is  quite  simple:  we  search  with  qx  in  the  segment  tree  in 
the  usual  way,  and  at  every  node  v  on  the  search  path  we  search  with  the  upper 
and  lower  endpoint  of  q  in  T(v)  to  report  the  segments  in  S(v)  intersected  by  q. 
This  basically  is  a  1 -dimensional  range  query — see  Section  5.1.  The  search 
in  T(v)  takes  0(log«  +  kv)  time,  where  kv  is  the  number  of  reported  segments 

_  at  v.  Hence,  the  total  query  time  is  0( log2  n  +  k),  and  we  obtain  the  following 
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Theorem  10.13  Let  Sbe  a  set  of  n  disjoint  segments  in  the  plane.  The  segments  Section  10.4 

intersecting  a  vertical  query  segment  can  be  reported  in  0(log2  n  +  k )  time  with  notes  and  comments 

a  data  structure  that  uses  0(n  log/;)  storage,  where  k  is  the  number  of  reported 
segments.  The  structure  can  be  built  in  O(nlogn)  time. 

Actually,  it  is  only  required  that  the  segments  have  disjoint  interiors.  It 
is  easily  verified  that  the  same  approach  can  be  used  when  the  endpoints  of 
segments  are  allowed  to  coincide  with  other  endpoints  or  segments.  This  leads 
to  the  following  result. 

Corollary  10.14  Let  S  be  a  set  of  n  segments  in  the  plane  with  disjoint  interiors. 

The  segments  intersecting  an  axis-parallel  rectangular  query  window  can  be 
reported  in  O(log2n  +  k)  time  with  a  data  structure  that  uses  0(n  log  n)  storage, 
where  k  is  the  number  of  reported  segments.  The  structure  can  be  built  in 
O(nlogn)  time. 


10.4  Notes  and  Comments 

The  query  that  asks  for  all  intervals  that  contain  a  given  point  is  often  referred 

to  as  a  stabbing  query.  The  interval  tree  structure  for  stabbing  queries  is  due  to 

Edelsbrunner  [157]  and  McCreight  [270].  The  priority  search  tree  was  designed  • 

by  McCreight  [271].  He  observed  that  the  priority  search  tree  can  be  used  for  ( a '**) 

stabbing  queries  as  well.  The  transformation  is  simple:  map  each  interval  [a  :  b] 

to  the  point  (a,  b)  in  the  plane.  Performing  a  stabbing  query  with  a  value  qx  can 

be  done  by  doing  a  query  with  the  range  (— °°  :  qx]  x  [qx  :  +°°).  Ranges  of  this  - • 

type  are  a  special  case  of  the  ones  supported  by  priority  search  trees. 

The  segment  tree  was  discovered  by  Bentley  [45].  Used  as  a  1 -dimensional 
data  structure  for  stabbing  queries  it  is  less  efficient  than  the  interval  tree 
since  it  requires  O(nlogn)  storage.  The  importance  of  the  segment  tree  is 
mainly  that  the  sets  of  intervals  stored  with  the  nodes  can  be  structured  in 
any  manner  convenient  for  the  problem  at  hand.  Therefore,  there  are  many 
extensions  of  segment  trees  that  deal  with  2-  and  higher-dimensional  objects 
[103,  157,  163,  301,  375].  A  second  plus  of  the  segment  tree  over  the  interval 
tree  is  that  the  segment  tree  can  easily  be  adapted  to  stabbing  counting  queries: 
report  the  number  of  intervals  containing  the  query  point.  Instead  of  storing  the 
intervals  in  a  list  with  the  nodes,  we  store  an  integer  representing  their  number. 

A  query  with  a  point  is  answered  by  adding  the  integers  on  one  search  path. 

Such  a  segment  tree  for  stabbing  counting  queries  uses  only  linear  storage  and 
queries  require  O(\ogn)  time,  so  it  is  optimal. 

There  has  been  a  lot  of  research  in  the  past  on  the  dynamization  of  interval 
trees  and  segment  trees,  that  is,  making  it  possible  to  insert  and/or  delete 
intervals.  Priority  search  trees  were  initially  described  as  a  fully  dynamic 
data  structure,  by  replacing  the  binary  tree  with  a  red-black  tree  [199,  137]  or 
other  balanced  binary  search  tree  that  requires  only  0(1)  rotations  per  update. 

Dynamization  is  important  in  situations  where  the  input  changes.  Dynamic  data  _ 

structures  are  also  important  in  many  plane  sweep  algorithms  where  the  status  237 


Chapter  10  normally  needs  to  be  stored  in  a  dynamic  structure.  In  a  number  of  problems  a 
more  geometric  data  structures  dynamic  version  of  the  segment  tree  is  required  here. 

The  notion  of  decomposable  searching  problems  [46, 48, 166, 254, 269,  276, 
304,  306,  307,  308,  337]  gave  a  great  push  toward  the  dynamization  of  a  large 
class  of  data  structures.  Let  S  be  a  set  of  objects  involved  in  a  searching  problem, 
and  let  A  U  B  be  any  partition  of  S.  The  searching  problem  is  decomposable  if  the 
solution  to  the  searching  problem  on  S  can  be  obtained  in  constant  time  from  the 
solutions  to  the  searching  problems  on  A  and  on  B.  The  stabbing  query  problem 
is  decomposable  because  if  we  have  reported  the  stabbed  intervals  in  the  two 
subsets  of  a  partition,  we  have  automatically  reported  all  intervals  in  the  whole 
set.  Similarly,  the  stabbing  counting  query  problem  is  decomposable  because 
we  can  add  the  integer  answers  yielded  by  the  subsets  to  give  the  answer  for  the 
whole  set.  Many  other  searching  problems,  like  the  range  searching  problem, 
are  decomposable  as  well.  Static  data  structures  for  decomposable  searching 
problems  can  be  turned  into  dynamic  structures  using  general  techniques.  For 
an  overview  see  the  book  by  Overmars  [299]. 

The  stabbing  query  problem  can  be  generalized  to  higher  dimensions.  Here 
we  are  given  a  collection  of  axis-parallel  (hyper-)rectangles  and  ask  for  those 
rectangles  that  contain  a  given  query  point.  To  solve  these  higher-dimensional 
stabbing  queries  we  can  use  multi-level  segment  trees.  The  structure  uses 
O{n\ogd~{  n)  storage  and  stabbing  queries  can  be  answered  in  0(\ogd  n)  time. 
The  use  of  fractional  cascading — see  Chapter  5 — lowers  the  query  time  bound 
by  a  logarithmic  factor.  The  use  of  the  interval  tree  on  the  deepest  level  of 
associated  structures  lowers  the  storage  bound  with  a  logarithmic  factor.  A 
higher-dimensional  version  of  the  interval  tree  and  priority  search  tree  doesn’t 
exist,  that  is,  there  is  no  clear  extension  of  the  structure  that  solves  the  analogous 
query  problem  in  higher  dimensions.  But  the  structures  can  be  used  as  associated 
structure  of  segment  trees  and  range  trees.  This  is  useful,  for  instance,  to  solve 
the  stabbing  query  problem  in  sets  of  axis-parallel  rectangles,  and  for  range 
searching  with  ranges  of  the  form  [jc  :  x1}  x  [y  :  /]  x  [z  :  +°°). 

In  geographic  information  systems,  the  most  widely  used  geometric  data 
structure  is  the  R-tree  [204].  It  is  a  2-dimensional  extension  of  the  well-known 
B-tree,  suitable  for  storage  on  disk.  On  disk,  memory  is  partitioned  in  blocks  of 
some  size  B ,  and  the  idea  is  to  have  nodes  of  high  degree  that  fit  exactly  into 
one  block.  An  internal  node  gives  rise  to  a  multi-way  split  instead  of  just  a 
binary  split,  allowing  the  tree  to  be  much  less  deep.  Queries  that  follow  only  one 
path  down  the  tree  require  fewer  blocks  from  disk,  and  are  therefore  answered 
more  efficiently  than  with  a  binary  tree.  An  R-tree  can  store  any  set  of  objects 
(points,  line  segments,  polygons),  and  answer  intersection  queries  with  any 
query  object.  An  R-tree  uses  linear  storage,  but  the  worst-case  query  time  is 
also  linear,  making  its  performance  nothing  better  than  a  simple  linked  list,  in 
theory.  In  practice,  however,  R-trees  perform  well.  A  few  theoretical  results  on 
R-trees  are  known.  It  was  shown  in  [5]  that  any  type  of  R-tree  built  on  n  points 
in  ^/-dimensional  space  has  to  visit  Cl((n/B)l~l^d  +  k/B)  blocks  (where  k  is  the 
number  of  answers).  A  variation  of  the  R-tree,  called  the  PR-tree  [18],  attains 
this  bound  if  both  the  objects  stored  and  the  query  objects  are  axis-parallel 
hyperrectangles.  For  an  extensive  overview  of  R-trees  and  related  structures, 
see  [335], 
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10.5  Exercises 

10.1  In  Section  10.1  we  solved  the  problem  of  finding  all  horizontal  line 
segments  in  a  set  that  intersect  a  vertical  segment.  For  this  we  used  an 
interval  tree  with  priority  search  trees  as  associated  structures.  There 
is  also  an  alternative  approach.  We  can  use  a  1  -dimensional  range  tree 
on  the  v-coordinate  of  the  segments  to  determine  those  segments  whose 
v-coordinate  lies  in  the  v-range  of  the  query  segment.  The  resulting 
segments  cannot  lie  above  or  below  the  query  segment,  but  they  may 
lie  completely  to  the  left  or  to  the  right  of  it.  We  get  those  segments 
in  (9(log«)  canonical  subsets.  For  each  of  these  subsets  we  use  as  an 
associated  structure  an  interval  tree  on  the  x-coordinates  to  find  the 
segments  that  actually  intersect  the  query  segment. 

a.  Give  the  algorithm  in  pseudocode. 

b.  Prove  that  the  data  structure  correctly  solves  the  queries. 

c.  What  are  the  bounds  for  preprocessing  time,  storage,  and  query  time 
of  this  stmcture?  Prove  your  answers. 

10.2  Let  P  be  a  set  of  n  points  in  the  plane,  sorted  on  v-coordinate.  Show 
that,  because  P  is  sorted,  a  priority  search  tree  of  the  points  in  P  can  be 
constructed  in  0(n)  time. 

10.3  In  the  exercisepart  of  the  algorithms  for  priority  search  trees  we  assumed 
that  all  points  have  distinct  coordinates.  It  was  indicated  that  this  re¬ 
striction  can  be  removed  by  using  composite  numbers  as  described  in 
Section  5.5.  Show  that  all  basic  operations  required  in  building  and 
querying  priority  search  trees  can  indeed  be  performed  with  composite 
numbers. 

10.4  Windowing  queries  in  sets  of  non-intersecting  segments  are  performed 
using  a  range  query  on  the  set  of  endpoints  and  intersection  queries  with 
the  four  boundaries  of  the  window.  Explain  how  to  avoid  that  segments 
are  reported  more  than  once.  To  this  end,  make  a  list  of  all  possible  ways 
in  which  an  arbitrarily  oriented  segment  can  intersect  a  query  window. 

10.5  In  this  exercise  you  must  show  how  the  data  structure  of  Theorem  10. 13 
can  be  built  in  0(ii  log/;)  time.  The  associated  structures  that  were 
used  are  binary  search  trees  on  the  vertical  order  of  the  segments.  If 
the  vertical  order  is  given,  then  an  associated  structure  can  be  built  in 
linear  time.  So  it  remains  to  compute  a  sorted  list  of  segments  for  each 
canonical  subset  in  O(nlogn)  time  in  total. 

Let  S  be  a  set  of  n  disjoint  segments  in  the  plane.  For  two  segments 
s,  s'  £  S,  we  say  that  ,v  lies  below  s',  denoted  s  -<  s',  if  and  only  if  there 
are  points  p  £  s,  p'  G  s'  with  px  =  p'x  and  py  <  p'y. 

a.  Prove  that  the  relation  -<  defines  an  acyclic  relation  on  S.  In  other 
words,  you  must  prove  that  an  order  si,S2,  ■  ■  ■  ,sn  for  the  segments  in 
S  exists  such  that  i  >  j  implies  Sj  sj. 
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b.  Describe  an  0{n  logn)  algorithm  to  compute  such  an  order.  Hint:  Use 
plane  sweep  to  find  all  segments  that  are  vertically  adjacent,  construct 
a  directed  graph  on  these  adjacencies,  and  apply  topological  sorting 
on  the  graph. 

c.  Explain  how  to  obtain  the  sorted  lists  for  the  canonical  subsets  in  the 
segment  tree  using  this  acyclic  order. 

10.6  Let  I  be  a  set  of  intervals  on  the  real  line.  We  want  to  be  able  to  count  the 
number  of  intervals  containing  a  query  point  in  0(log«)  time.  Thus,  the 
query  time  must  be  independent  of  the  number  of  segments  containing 
the  query  point. 

a.  Describe  a  data  structure  for  this  problem  based  on  segment  trees, 
which  uses  only  0{n)  storage.  Analyze  the  preprocessing  time,  and 
the  query  time  of  the  data  structure. 

b.  Describe  a  data  structure  for  this  problem  based  on  interval  trees.  You 
should  replace  the  lists  associated  with  the  nodes  of  the  interval  tree 
with  other  structures.  Analyze  the  amount  of  storage,  preprocessing 
time,  and  the  query  time  of  the  data  structure. 

c.  Describe  a  data  structure  for  this  problem  based  on  a  simple  binary 
search  tree.  Your  structure  should  have  O(n)  storage  and  O(logn) 
query  time.  (Hence,  segment  trees  are  actually  not  needed  to  solve 
this  problem  efficiently.) 

10.7  a.  We  want  to  solve  the  following  query  problem:  Given  a  set  S  of  n 

disjoint  line  segments  in  the  plane,  determine  those  segments  that 
intersect  a  vertical  ray  running  from  a  point  (qx,qy)  vertically  up¬ 
wards  to  infinity.  Describe  a  data  structure  for  this  problem  that  uses 
O(n\ogn)  storage  and  has  a  query  time  of  O(\ogn  +k),  where  k  is 
the  number  of  reported  answers. 

b.  Now  suppose  we  only  want  to  report  the  first  segment  hit  by  the  query 
ray.  Describe  a  data  structure  for  this  problem  with  0(n)  expected 
storage  and  O(k)gii)  expected  query  time.  Hint:  Apply  the  locus 
approach. 

10.8  Segment  trees  can  be  used  for  multi-level  data  structures. 

a.  Let  R  be  a  set  of  n  axis-parallel  rectangles  in  the  plane.  Design  a  data 
structure  for  R  such  that  the  rectangles  containing  a  query  point  q 
can  be  reported  efficiently.  Analyze  the  amount  of  storage  and  the 
query  time  of  your  data  structure.  Hint:  Use  a  segment  tree  on  the 
x- intervals  of  the  rectangles,  and  store  canonical  subsets  of  the  nodes 
in  this  segment  tree  in  an  appropriate  associated  structure. 

b.  Generalize  this  data  structure  to  t/-dimensional  space.  Here  we  are 
given  a  set  of  axis-parallel  hyperrectangles — that  is,  polytopes  of  the 
form  [xi  :  x'j]  x  [x2  :  x^]  x  •  •  •  x  [x(i :  xk\ — and  we  want  to  report  the 
hyperrectangles  containing  a  query  point.  Analyze  the  amount  of 
storage  and  the  query  time  of  your  data  structure. 
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10.9  Let  I  be  a  set  of  intervals  on  the  real  line.  We  want  to  store  these  intervals 
such  that  we  can  efficiently  determine  those  intervals  that  are  completely 
contained  in  a  given  interval  [x  :  x'].  Describe  a  data  structure  that  uses 
O(nlogn)  storage  and  solves  such  queries  in  O(log/i  +  k)  time,  where  k 
is  the  number  of  answers.  Hint:  Use  a  range  tree. 

10.10  Again  we  have  a  collection  I  of  intervals  on  the  real  line,  but  this  time  we 
want  to  efficiently  determine  those  intervals  that  contain  a  given  interval 
[x  :  x!\.  Describe  a  data  structure  that  uses  0(n)  storage  and  solves  such 
queries  in  O(log/i  +  k)  time,  where  k  is  the  number  of  answers.  Hint: 
Use  a  priority  search  tree. 

10.1 1  Consider  the  following  alternative  approach  to  solve  the  2-dimensional 
range  searching  problem:  We  construct  a  balanced  binary  search  tree  on 
the  x-coordinate  of  the  points.  For  a  node  v  in  the  tree,  let  P(  v)  be  the 
set  of  points  stored  in  the  subtree  rooted  at  V.  For  each  node  v  we  store 
two  associated  priority  search  trees  of  P(v),  a  tree  Tieft(v)  allowing 
for  range  queries  that  are  unbounded  to  the  left,  and  a  tree  Tr;ght(v)  for 
range  queries  that  are  unbounded  to  the  right. 

A  query  with  a  range  [x :  x7]  x  \y  :  /]  is  performed  as  follows.  We  search 
for  the  node  vspijt  where  the  search  paths  toward  x  and  x '  split  in  the  tree. 
Now  we  perform  a  query  with  range  [x :  +°°)  x  [y  :  y'\  on  Trjght(7c(vspiit)) 
and  a  query  with  range  (— °°  :  x7]  x  [y  :  /]  on  Tieft(rc(vspiit)).  This  gives 
all  the  answers  (there  is  no  need  to  search  further  down  in  the  tree!). 

a.  Prove  that  the  data  structure  correctly  solves  range  queries. 

b.  What  are  the  bounds  for  preprocessing  time,  storage,  and  query  time 
of  this  structure?  Prove  your  answers. 

10.12  a.  In  Section  10.3  an  algorithm  was  given  to  insert  an  interval  into 

a  segment  tree  (assuming  its  endpoints  are  already  present  in  the 
skeleton  of  the  tree).  Show  that  it  is  also  possible  to  delete  intervals 
in  O(\ogn)  time.  (You  may  need  to  maintain  some  extra  information 
for  this.) 

b.  Let  P  =  {p\, . . .  ,p„}  be  a  set  of  n  points  in  the  plane,  and  let  R  = 
{ri , . . . ,  r„ }  be  a  set  of  n  possibly  intersecting  rectangles.  Give  an 
algorithm  to  report  all  pairs  Pi,rj  such  that  p,  £  rj.  Your  algorithm 
should  run  in  0{n  log  n  +  k)  time,  where  k  is  the  number  of  reported 
pairs. 
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11  Convex  Hulls 


Mixing  Things 


The  output  of  oil  wells  is  a  mixture  of  several  different  components,  and  the 
proportions  of  these  components  vary  between  different  sources.  This  can 
sometimes  be  exploited:  by  mixing  together  the  output  of  different  wells,  one 
can  produce  a  mixture  with  proportions  that  are  particularly  favorable  for  the 
refining  process. 

Let’s  look  at  an  example.  For  simplicity  we  assume  that  we  are  only 
interested  in  two  of  the  components — call  them  A  and  B — of  our  product. 
Assume  that  we  are  given  a  mixture  with  10%  of  component  A  and  35%  of 
component  B ,  and  another  mixture  go  with  16%  of  A  and  20%  of  B.  Assume 
further  that  what  we  really  need  is  a  mixture  that  contains  12%  of  A  and  30% 
of  B.  Can  we  produce  this  mixture  from  the  given  ones?  Yes,  mixing  gt  and  go 
in  the  ratio  2  :  1  gives  the  desired  product.  Flowever,  it  is  impossible  to  make 
a  mixture  of  c,\  and  that  contains  13%  of  A  and  22%  of  B.  But  if  we  have  a 
third  mixture  g^  containing  7%  of  A  and  15%  of  B,  then  mixing  g\ ,  <52,  and  g^ 
in  the  ratio  of  1  :  3  :  1  will  give  the  desired  result. 

What  has  all  this  to  do  with  geometry?  This  becomes  clear  when  we 
represent  the  mixtures  |i,  <*2,  and  g-\  by  points  in  the  plane,  namely  by  p\  := 
(0.1,0.35),  p2  :=  (0.16,0.2),  and  po,  :=  (0.07,0.15).  Mixing  E,\  and  tgo  in  the 
ratio  2  :  1  gives  the  mixture  represented  by  the  point  q  (2/3)p\  +  (1  /3)p2- 
This  is  the  point  on  the  segment  p\p2  such  that  dist(/?2,<?)  :  distC/.  p  \ )  =2:1, 
where  dist(.,.)  denotes  the  distance  between  two  points.  More  generally,  by 
mixing  g\  and  go  in  varying  ratios,  we  can  produce  the  mixtures  represented 
by  any  point  on  the  line  segment  p\p2-  If  we  start  with  the  three  base  mixtures 
4i,  c,o_,  and  1*3,  we  can  produce  any  point  in  the  triangle  p\  P2P:\-  For  instance, 
mixing  tg\,  tg 2 ,  and  g-\  in  the  ratio  1:3:1  gives  the  mixture  represented  by  the 
point  (l/5)p!  +  (3/5)/>2  +  (l/5)p3  =  (0.13,0.22). 

What  happens  if  we  don’t  have  three  but  n  base  mixtures,  for  some  n  >  3, 
represented  by  points  p\,p2,-..,PrP-  Suppose  that  we  mix  them  in  the  ratio 
l\  :  h  :  •  •  •  :  /„.  Let  L  :=  Yl}=  1  h  and  let  K  ;=  h/L.  Note  that 

n 

Xj  ^  0  for  all  i  and  ^  A,  =  1 . 

i=  1 

The  mixture  we  get  by  mixing  the  base  mixtures  in  the  given  ratio  is  the  one 


(0.1,0.35) 
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represented  by 

i^Pi. 

i=  1 

Such  a  linear  combination  of  the  points  p,  where  the  A,  satisfy  the  conditions 
stated  above — each  A;  is  non-negative,  and  the  sum  of  the  A,-  is  one — is  called  a 
convex  combination.  In  Chapter  1  we  defined  the  convex  hull  of  a  set  of  points 
as  the  smallest  convex  set  containing  the  points  or,  more  precisely,  as  the  inter¬ 
section  of  all  convex  sets  containing  the  points.  One  can  show  that  the  convex 
hull  of  a  set  of  points  is  exactly  the  set  of  all  possible  convex  combinations  of 
the  points.  We  can  therefore  test  whether  a  mixture  can  be  obtained  from  the 
base  mixtures  by  computing  the  convex  hull  of  their  representative  points,  and 
checking  whether  the  point  representing  the  mixture  lies  inside  it. 

What  if  there  are  more  than  two  interesting  components  in  the  mixtures? 
Well,  what  we  have  said  above  remains  true;  we  just  have  to  move  to  a  space  of 
higher  dimension.  More  precisely,  if  we  want  to  take  d  components  into  account 
we  have  to  represent  a  mixture  by  a  point  in  ^/-dimensional  space.  The  convex 
hull  of  the  points  representing  the  base  mixtures,  which  is  a  convex  polytope, 
represents  the  set  of  all  possible  mixtures. 

Convex  hulls — in  particular  convex  hulls  in  3-dimensional  space — are  used  in 
various  applications.  For  instance,  they  are  used  to  speed  up  collision  detection 
in  computer  animation.  Suppose  that  we  want  to  check  whether  two  objects  Ii 
and  lJ>2  intersect.  If  the  answer  to  this  question  is  negative  most  of  the  time,  then 
the  following  strategy  pays  off.  Approximate  the  objects  by  simpler  objects 
I]  and  IS  that  contain  the  originals.  If  we  want  to  check  whether  IP]  and  IPo 
intersect,  we  first  check  whether  Ii  and  IS  intersect;  only  if  this  is  the  case  do 
we  need  to  perform  the — supposedly  more  costly — test  on  the  original  objects. 

There  is  a  trade-off  in  the  choice  of  the  approximating  objects.  On  the  one 
hand,  we  want  them  to  be  simple  so  that  intersection  tests  are  cheap.  On  the  other 
hand,  simple  approximations  most  likely  do  not  approximate  the  original  objects 
very  well,  so  there  is  a  bigger  chance  we  have  to  test  the  originals.  Bounding 
spheres  are  on  one  side  of  the  spectrum;  intersection  tests  for  spheres  are  quite 
simple,  but  for  many  objects  spheres  do  not  provide  a  good  approximation. 
Convex  hulls  are  more  on  the  other  side  of  the  spectrum:  intersection  tests  for 
convex  hulls  are  more  complicated  than  for  spheres — but  still  simpler  than  for 
non-convex  objects — but  convex  hulls  can  approximate  most  objects  a  lot  better. 


11.1  The  Complexity  of  Convex  Hulls  in  3-Space 

In  Chapter  1  we  have  seen  that  the  convex  hull  of  a  set  P  of  n  points  in  the  plane 
is  a  convex  polygon  whose  vertices  are  points  in  P.  Hence,  the  convex  hull 
has  at  most  n  vertices.  In  3-dimensional  space  a  similar  statement  is  true:  the 
convex  hull  of  a  set  P  of  n  points  is  a  convex  polytope  whose  vertices  are  points 

_  in  P  and.  hence,  it  has  at  most  n  vertices.  In  the  planar  case  the  bound  on  the 
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is  linear,  since  the  number  of  edges  of  a  planar  polygon  is  equal  to  the  number 
of  vertices.  In  3-space  this  is  no  longer  true;  the  number  of  edges  of  a  polytope 
can  be  higher  than  the  number  of  vertices.  But  fortunately  the  difference  cannot 
be  too  large,  as  follows  from  the  following  theorem  on  the  number  of  edges  and 
facets  of  convex  polytopes.  (Formally,  a  facet  of  a  convex  polytope  is  defined 
to  be  a  maximal  subset  of  coplanar  points  on  its  boundary.  A  facet  of  a  convex 
polytope  is  necessarily  a  convex  polygon.  An  edge  of  a  convex  polytope  is  an 
edge  of  one  of  its  facets.) 

Theorem  11.1  Let  !P  be  a  convex  polytope  with  n  vertices.  The  number  of 
edges  of  CP  is  at  most  3  n  —  6,  and  the  number  of  facets  of  CP  is  at  most  2  n  —  4. 

Proof.  Recall  that  Euler’s  formula  states  for  a  connected  planar  graph  with  n 
nodes,  ne  arcs,  and  tif  faces  the  following  relation  holds: 

n  —  ne  +  rif  =  2. 

Since  we  can  interpret  the  boundary  of  a  convex  polytope  as  a  planar  graph — see 
Figure  11.1 — the  same  relation  holds  for  the  numbers  of  vertices,  edges,  and 
facets  in  a  convex  polytope.  (In  fact,  Euler’s  formula  was  originally  stated  in 


terms  of  polytopes,  not  in  terms  of  planar  graphs.)  Every  face  of  the  graph 
corresponding  to  CP  has  at  least  three  arcs,  and  every  arc  is  incident  to  two  faces, 
so  we  have  2 ne  f  3 n/.  Plugging  this  into  Euler’s  formula  we  get 

n  +  rif  —  2  ^  3tif/2, 

so  rif  ^  2 n  —  4.  Applying  Euler’s  formula  once  more,  we  see  that  ne  f  3 n  —  6. 
For  the  special  case  that  every  facet  is  a  triangle — the  case  of  a  simplicial 
polytope — the  bounds  on  the  number  of  edges  and  facets  of  an  n-vertex  polytope 
are  exact,  because  then  2 ne  =  3 n /.  0 

Theorem  11.1  also  holds  for  non-convex  polytopes  whose  so-called  genus 
is  zero,  that  is,  polytopes  without  holes  or  tunnels;  for  polytopes  of  larger  genus 
similar  bounds  hold.  Since  this  chapter  deals  with  convex  hulls,  however,  we 
refrain  from  defining  what  a  (non-convex)  polytope  exactly  is,  which  we  would 
need  to  do  to  prove  the  theorem  in  the  non-convex  case. 
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edge 


Figure  11.1 

A  cube  interpreted  as  a  planar  graph: 
note  that  one  facet  maps  to  the 
unbounded  face  of  the  graph 
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Chapter  11  If  we  combine  Theorem  11.1  with  the  earlier  observation  that  the  convex 
convex  hulls  hull  of  a  set  of  points  in  3-space  is  a  convex  polytope  whose  vertices  are  points 

in  P,  we  get  the  following  result. 

Corollary  11.2  The  complexity  of  the  convex  hull  of  a  set  of  n  points  in  three- 
dimensional  space  is  0(n). 


11.2  Computing  Convex  Hulls  in  3-Space 

Let  P  be  a  set  of  n  points  in  3-space.  We  will  compute  CJf(P),  the  convex  hull 
of  P,  using  a  randomized  incremental  algorithm,  following  the  paradigm  we 
have  met  before  in  Chapters  4,  6,  and  9. 

The  incremental  construction  starts  by  choosing  four  points  in  P  that  do  not  lie 
in  a  common  plane,  so  that  their  convex  hull  is  a  tetrahedron.  This  can  be  done 
as  follows.  Let  p\  and  /u  be  two  points  in  P.  We  walk  through  the  set  P  until 
we  find  a  point  p 3  that  does  not  lie  on  the  line  through  p\  and  pi .  We  continue 
searching  P  until  we  find  a  point  p4  that  does  not  lie  in  the  plane  through  pi,  P2, 
and  pi.  (If  we  cannot  find  four  such  points,  then  all  points  in  P  lie  in  a  plane.  In 
this  case  we  can  use  the  planar  convex  hull  algorithm  of  Chapter  1  to  compute 
the  convex  hull.) 

Next  we  compute  a  random  permutation  p$, . . .  ,p„  of  the  remaining  points. 
We  will  consider  the  points  one  by  one  in  this  random  order,  maintaining  the 
convex  hull  as  we  go.  For  an  integer  r  ip  \,  let  P,  :=  {p\ . . . .  ,pr}.  In  a  generic 
step  of  the  algorithm,  we  have  to  add  the  point  pr  to  the  convex  hull  of  P,  1 , 
that  is,  we  have  to  transform  CJf(F’r-i)  into  CJf (Pr).  There  are  two  cases. 

■  If  pr  lies  inside  CTf(.P,-_i),  or  on  its  boundary,  then  Q3~C(Pr)  =  C!K(P,-_i), 
and  there  is  nothing  to  be  done. 


Figure  11.2 
The  horizon  of  a  polytope 
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Now  suppose  that  pr  lies  outside  Imagine  that  you  are  standing 

at  pr,  and  that  you  are  looking  at  G‘Ji  (Pr  1).  You  will  be  able  to  see  some 


facets  of  C!K(P,  _i) — the  ones  on  the  front  side — but  others  will  be  invisible 
because  they  are  on  the  back  side.  The  visible  facets  form  a  connected  region 
on  the  surface  of  CTf(Pr-i),  called  the  visible  region  of  pr  on  CTC(.P,— l), 
which  is  enclosed  by  a  closed  curve  consisting  of  edges  of  G'K(Pr_\).  We 
call  this  curve  the  horizon  of  pr  on  CTf  (Pr_ 1 ) .  As  you  can  see  in  Figure  1 1 .2, 
the  projection  of  the  horizon  is  the  boundary  of  the  convex  polygon  obtained 
by  projecting  G3~C(Pr^\)  onto  a  plane,  with  pr  as  the  center  of  projection. 
What  exactly  does  “visible”  mean  geometrically?  Consider  the  plane  h  / 
containing  a  facet  /  of  G'K(Pr .  | ).  By  convexity,  G'K(Pr  _  i j  is  completely 
contained  in  one  of  the  closed  half-spaces  defined  by  hr.  The  face  /  is 
visible  from  a  point  if  that  point  lies  in  the  open  half-space  on  the  other  side 
of  Ilf. 

The  horizon  of  pr  plays  a  crucial  role  when  we  want  to  transform  CTf(T>r-i) 
to  Q'K(Pr)\  it  forms  the  border  between  the  part  of  the  boundary  that  can 
be  kept — the  invisible  facets — and  the  part  of  the  boundary  that  must  be 
replaced — the  visible  facets.  The  visible  facets  must  be  replaced  by  facets 
connecting  pr  to  its  horizon. 

Before  we  go  into  more  details,  we  should  decide  how  we  are  going  to  represent 
the  convex  hull  of  points  in  space.  As  we  observed  before,  the  boundary  of  a 
3-dimensional  convex  polytope  can  be  interpreted  as  a  planar  graph.  Therefore 
we  store  the  convex  hull  in  the  form  of  a  doubly-connected  edge  list,  a  data 
structure  developed  in  Chapter  2  for  storing  planar  subdivisions.  The  only 
difference  is  that  vertices  will  now  be  3 -dimensional  points.  We  will  keep  the 
convention  that  the  half-edges  are  directed  such  that  the  ones  bounding  any  face 
form  a  counterclockwise  cycle  when  seen  from  the  outside  of  the  polytope. 


etK(Pr_i) 


eK(pr) 

Back  to  the  addition  of  pr  to  the  convex  hull.  We  have  a  doubly-connected 
edge  list  representing  G‘J((P,- .  i ),  which  we  have  to  transform  into  a  doubly- 
connected  edge  list  for  C5f(Pr).  Suppose  that  we  knew  all  facets  of  G,M.(Pr .  i ) 
visible  from  pr.  Then  it  would  be  easy  to  remove  all  the  information  stored 
for  these  facets  from  the  doubly-connected  edge  list,  compute  the  new  facets 
connecting  pr  to  the  horizon,  and  store  the  information  for  the  new  facets  in  the 
doubly-connected  edge  list.  All  this  will  take  linear  time  in  the  total  complexity 
of  the  facets  that  disappear. 

There  is  one  subtlety  we  should  take  care  of  after  the  addition  of  the  new 
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f  is  visible  from  p, 
but  not  from  q 


Figure  11.3 

Adding  a  point  to  the  convex  hull 
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conflicts 


facets:  we  have  to  check  whether  we  have  created  any  coplanar  facets.  This 
happens  if  pr  lies  in  the  plane  of  a  face  of  Q'K(Pr-\).  Such  a  face  /  is  not  visible 
from  pr  by  our  definition  of  visibility  above.  Hence,  /  will  remain  unchanged, 
and  we  will  add  triangles  connecting  pr  to  the  edges  of  /  that  are  part  of  the 
horizon.  Those  triangles  are  coplanar  with  /,  and  so  they  have  to  be  merged 
with  /  into  one  facet. 

In  the  discussion  so  far  we  have  ignored  the  problem  of  finding  the  facets  of 
C5f(Pr_i)  that  are  visible  from  pr.  Of  course  this  could  be  done  by  testing 
every  facet.  Since  such  a  test  takes  constant  time — we  have  to  check  to  which 
side  of  a  given  plane  the  point  p,  lies — we  can  find  all  visible  facets  in  O(r) 
time.  This  would  lead  to  an  0{n2)  algorithm.  Next  we  show  how  to  do  better. 

The  trick  is  that  we  are  going  to  work  ahead:  besides  the  convex  hull  of 
the  current  point  set  we  shall  maintain  some  additional  information,  which  will 
make  it  easy  to  find  the  visible  facets.  In  particular,  we  maintain  for  each  facet 
/  of  the  current  convex  hull  Q‘K{Pr)  a  set  PConflict(/)  C  {pr+1,pr+2,...,p„} 
containing  the  points  that  can  see  /.  Conversely,  we  store  for  every  point  pt, 
with  t  >  r,  the  set  T’conffictCf’f)  of  facets  of  CTC(H,-)  visible  from  pr.  We  will  say 
that  a  point  p  £  PConflict(/)  is  in  conflict  with  the  facet  /,  because  p  and  /  cannot 
peacefully  live  together  in  the  convex  hull — once  we  add  a  point  p  £  Hconflict(/) 
to  the  convex  hull,  the  facet  /  must  go.  We  call  HConfiict(/)  and  T’conflictCp?) 
conflict  lists. 

We  maintain  the  conflicts  in  a  so-called  conflict  graph,  which  we  denote 
by  9-  The  conflict  graph  is  a  bipartite  graph.  It  has  one  node  set  with  a  node  for 
every  point  of  P  that  has  not  been  inserted  yet,  and  one  node  set  with  a  node 
for  every  facet  of  the  current  convex  hull.  There  is  an  arc  for  every  conflict 
between  a  point  and  a  facet.  In  other  words,  there  is  an  arc  between  a  point 
pt  £  P  and  facet  /  of  CTf(P,)  if  r  <  t  and  /  is  visible  from  pt.  Using  the  conflict 
graph  S,  we  can  report  the  set  T’COnfiict(pr)  for  a  given  point  p,  (or  PCOnflict(/)  for 
a  given  facet  /)  in  time  linear  in  its  size.  This  means  that  when  we  insert  pr 
into  all  we  have  to  do  is  to  look  up  /Vonfiict(Pr)  in  9  to  get  the  visible 

facets,  which  we  can  then  replace  by  the  new  convex  hull  facets  connecting  pr 
to  the  horizon. 


Initializing  the  conflict  graph  9  for  CTCf/li)  can  be  done  in  linear  time:  we 
simply  walk  through  the  list  of  points  P  and  determine  which  of  the  four  faces 
of  CTC(P4)  they  can  see. 

To  update  9  after  adding  a  point  pr,  we  first  discard  the  nodes  and  incident 
arcs  for  all  the  facets  of  CTH'(7V- 1  j  that  disappear  from  the  convex  hull.  These 
are  the  facets  visible  from  pr,  which  are  exactly  the  neighbors  of  pr  in  9,  so 
this  is  easy.  We  also  discard  the  node  for  We  then  add  nodes  to  9  for  the 
new  facets  we  created,  which  connect  pr  to  the  horizon.  The  essential  step  is  to 
find  the  conflict  lists  of  these  new  facets.  No  other  conflicts  have  to  be  updated: 
the  conflict  set  HConflict(/)  of  a  facet  /  that  is  unaffected  by  the  insertion  of  pr 
remains  unchanged. 

_  The  facets  created  by  the  insertion  of  pr  are  all  triangles,  except  for  those 
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of  the  latter  type  is  trivial  to  find:  it  is  the  same  as  the  conflict  list  of  the  existing 
facet,  since  the  merging  does  not  change  the  plane  containing  the  facet.  So  let’s 
look  at  one  of  the  new  triangles  /  incident  to  pr  in  Q‘K(Pr).  Suppose  that  a  point 
pt  can  see  /.  Then  p,  can  certainly  see  the  edge  e  of  /  that  is  opposite  pr.  This 
edge  e  is  a  horizon  edge  of  pr.  and  it  was  already  present  in  G,'.K(Pr  |).  Since 
CTC(T’r-i)  C  CK(Pr),  the  edge  e  must  have  been  visible  from  pt  in  G3~C(Pr^i) 
as  well.  That  can  only  be  the  case  if  one  of  the  two  facets  incident  to  e  in 
G“K(Pr- 1)  is  visible  from  pt.  This  implies  that  the  conflict  list  of  /  can  be  found 
by  testing  the  points  in  the  conflict  lists  of  the  two  facets  f\  and  /?  that  were 
incident  to  the  horizon  edge  e  in  Q‘K(Pr^\). 
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We  stated  earlier  that  we  store  the  convex  hull  as  a  doubly-connected  edge  list, 
so  changing  the  convex  hull  means  changing  the  information  in  the  doubly- 
connected  edge  list.  To  keep  the  code  short,  however,  we  have  omitted  all 
explicit  references  to  the  doubly-connected  edge  list  in  the  pseudocode  below, 
which  summarizes  the  convex  hull  algorithm. 

Algorithm  ConvexHullCP) 

Input.  A  set  P  of  n  points  in  three-space. 

Output.  The  convex  hull  G'K(P)  of  P. 

1 .  Find  four  points  p\  -  P2,  P4-P4  in  P  that  form  a  tetrahedron. 

2.  e<-  e^i({pup2,P3,P4}) 

3.  Compute  a  random  permutation  ps,pe, . . .  ,pn  of  the  remaining  points. 

4.  Initialize  the  conflict  graph  S  with  all  visible  pairs  ( ptlf ),  where  /  is  a 
facet  of  C  and  t  >  4. 

5.  for  r  < —  5  to  n 

6.  do  (*  Insert  pr  into  C:  *) 

7.  if  is  not  empty  (*  that  is,  pr  lies  outside  6  *) 

8.  then  Delete  all  facets  in  FCOnnict(Pr)  from  C. 

9.  Walk  along  the  boundary  of  the  visible  region  of  pr  (which 
consists  exactly  of  the  facets  in  T’conflict(T’r))  and  create  a  list 
L  of  horizon  edges  in  order. 


10. 

for  all  e  G  £ 

11. 

do  Connect  e  to  pr  by  creating  a  triangular  facet  /. 

12. 

if  /  is  coplanar  with  its  neighbor  facet  f  along  e 

13. 

then  Merge  /  and  f  into  one  facet,  whose  conflict 
list  is  the  same  as  that  of  /'. 

14. 

else  (*  Determine  conflicts  for  /:  *) 

15. 

Create  a  node  for  /  in  (-]• 

16. 

Let  f\  and  />  be  the  facets  incident  to  e  in  the 
old  convex  hull. 

17. 

Pip)  <  7>conflict(/l )  U-PconflictC/iz) 

18. 

for  all  points  p  £  P(e) 

19. 

do  If  /  is  visible  from  p ,  add  (p.f)  to  9- 

20. 

21.  return  6 

Delete  the  node  corresponding  to  p,  and  the  nodes  corre¬ 
sponding  to  the  facets  in  T’Confiict(.£v)  from  S,  together  with 
their  incident  arcs. 
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11.3*  The  Analysis 

As  usual  when  we  analyse  a  randomized  incremental  algorithm,  we  first  try  to 
bound  the  expected  structural  change.  For  the  convex  hull  algorithm  this  means 
we  want  to  bound  the  total  number  of  facets  created  by  the  algorithm. 

Lemma  11.3  The  expected  number  of  facets  created  by  ConvexHull  is  at 
most  6 n  —  20. 

Proof.  The  algorithm  starts  with  a  tetrahedron,  which  has  four  facets.  In  every 
stage  r  of  the  algorithm  where  pr  lies  outside  G'K(Pr~\),  new  triangular  facets 
connecting  pr  to  its  horizon  on  GT((Pr- 1)  are  created.  What  is  the  expected 
number  of  new  facets?  As  in  previous  occasions  where  we  analyzed  randomized 
algorithms,  we  use  backwards  analysis.  We  look  at  CK(Pr)  and  imagine  remov¬ 
ing  vertex  the  number  of  facets  that  disappear  due  to  the  removal  of  pr  from 
G3~C(Pr)  is  the  same  as  the  number  of  facets  that  were  created  due  to  the  insertion 
of  pr  into  G‘K{Pr-\).  The  disappearing  facets  are  exactly  the  ones  incident  to  pr , 
and  their  number  equals  the  number  of  edges  incident  to  pr  in  CTf(Pr).  We  call 
this  number  the  degree  of  pr  in  6 “K(Pr),  and  we  denote  it  by  deg(pr,  CTC(P,  )). 
We  now  want  to  bound  the  expected  value  of  deg(j?,.,  G‘K(Pr)). 

By  Theorem  11.1a  convex  polytope  with  r  vertices  has  at  most  3r  —  6  edges. 
This  means  that  the  sum  of  the  degrees  of  the  vertices  of  CTf(Pr),  which  is  a 
convex  polytope  with  r  or  less  vertices,  is  at  most  6r-  12.  Hence,  the  average 
degree  is  bounded  by  6  —  12/r.  Since  we  treat  the  vertices  in  random  order, 
it  seems  that  the  expected  degree  of  pr  is  bounded  by  6  —  12/r.  We  have  to 
be  a  little  bit  careful,  though:  the  first  four  points  are  already  fixed  when  we 
generate  the  random  permutation,  so  pr  is  a  random  element  of  { p$ . . . .  ,pr}, 
not  of  Pr.  Because  p\,...,p4  have  total  degree  at  least  12,  the  expected  value  of 
deg (pr,  G 3i(Pr))  is  bounded  as  follows: 

E[deg(pr,eTf(Pr))]  =  — Ytdeg(pi,GK(Pr)) 

r  **  i'=5 

<  ^({tdeg(;r,eW))]-12) 

6r  —  12  —  12 

<  - 7 -  =  6. 

r  —  4 

The  expected  number  of  facets  created  by  ConvexHull  is  the  number  of 
facets  we  start  with  (four)  plus  the  expected  total  number  of  facets  created 
during  the  additions  of  ps, . . .  ,pn  to  the  hull.  Hence,  the  expected  number  of 
created  facets  is 

n 

4+^E[deg(pr,e5f(Pr))]  <  4  +  6(n—4)  =6n-20.  □ 

r= 5 

Now  that  we  have  bounded  the  total  amount  of  structural  change  we  can 
bound  the  expected  running  time  of  the  algorithm. 
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Lemma  11.4  Algorithm  ConvexHull  computes  the  convex  hull  of  a  set  P  of  Section  11.3* 
n  points  in  R3  in  0{n  logn)  expected  time,  where  the  expectation  is  with  respect  the  analysis 
to  the  random  permutation  used  by  the  algorithm. 

Proof.  The  steps  before  the  main  loop  can  certainly  be  done  in  O(nlogn)  time. 

Stage  r  of  the  algorithm  takes  constant  time  if  Fconflict(f’r)  is  empty,  which  is 
when  pr  lies  inside,  or  on  the  boundary  of,  the  current  convex  hull. 

If  that  is  not  the  case,  most  of  stage  r  takes  <9(card(.Fconflict(p,.)))  time, 
where  card()  denotes  the  cardinality  of  a  set.  The  exceptions  to  this  are  the 
lines  17-19  and  line  20.  We  shall  bound  the  time  spent  in  these  lines  later; 
first,  we  bound  card(,FConffict(.fv))-  Note  that  card(T’conflict (/?,-))  is  the  number  of 
facets  deleted  due  to  the  addition  of  the  point  pr.  Clearly,  a  facet  can  only  be 
deleted  if  it  has  been  created  before,  and  it  is  deleted  at  most  once.  Since  the 
expected  number  of  facets  created  by  the  algorithm  is  0(n )  by  Lemma  11.3, 
this  implies  that  the  total  number  of  deletions  is  0(n)  as  well,  so 

n 

E[£  card(Fconflict(pr))]  =  0(n). 

r= 5 

Now  for  lines  17-19  and  line  20.  Line  20  takes  time  linear  in  the  number 
of  nodes  and  arcs  that  are  deleted  from  S.  Again,  a  node  or  arc  is  deleted  at 
most  once,  and  we  can  charge  the  cost  of  this  deletion  to  the  stage  where  we 
created  it.  It  remains  to  look  at  lines  17-19.  In  stage  r,  these  lines  are  executed 
for  all  horizon  edges,  that  is,  all  edges  in  L.  For  one  edge  e  £  L,  they  take 
0(card(P(<?)))  time.  Hence,  the  total  time  spent  in  these  lines  in  stage  r  is 
OCE.fr c  card(P(<?))).  To  bound  the  total  expected  running  time,  we  therefore 
have  to  bound  the  expected  value  of 

£card(P(e)), 

e 

where  the  summation  is  over  all  horizon  edges  that  appear  at  any  stage  of  the 
algorithm.  We  will  prove  below  that  this  is  O(n\ogn),  which  implies  that  the 
total  running  time  is  O(n\ogn).  0 

We  use  the  framework  of  configuration  spaces  from  Chapter  9  to  supply 
the  missing  bound.  The  universe  X  is  the  set  of  P ,  and  the  configurations  A 
correspond  to  convex  hull  edges.  However,  for  technical  reasons — in  particular, 
to  be  able  to  deal  correctly  with  degenerate  cases — we  attach  a  half-edge  to 
both  sides  of  the  edge.  To  be  more  precise,  a  flap  A  is  defined  as  an  ordered 
four-tuple  of  points  ( p,q,s,t )  that  do  not  all  lie  in  a  plane.  The  defining  set  D{  A) 
is  simply  the  set  {p,q,s,t}.  The  killing  set  K( A)  is  more  difficult  to  visualize. 

Denote  the  line  through  p  and  q  by  i.  Given  a  point  x,  let  li(t.x)  denote  the 
half-plane  bounded  by  i  that  contains  a.  Given  two  points  a,  y,  let  p  (A,y)  be  the 
half-line  starting  in  a  and  passing  through  y.  A  point  x  €  X  is  in  K( A)  if  and 
only  if  it  lies  in  one  of  the  following  regions: 

■  outside  the  closed  convex  3-dimensional  wedge  defined  by  h(£,  s)  and 
m  inside  h(£,  s)  but  outside  the  closed  2-dimensional  wedge  defined  by  p(p,q) 

and  p(p,s). 
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■  inside  h(£,t)  but  outside  the  closed  2-dimensional  wedge  defined  by  p{q,t) 
and  p(q,p), 

m  inside  the  line  £  but  outside  the  segment  ~pq , 

■  inside  the  half-line  p(p,s )  but  outside  the  segment  ps, 
m  inside  the  half-line  p(q,t)  but  outside  the  segment  qt. 

For  every  subset  S  CP,  we  define  the  set  T(S)  of  active  configurations — this  is 
what  we  want  to  compute — as  prescribed  in  Chapter  9:  As  'T(.S')  if  and  only  if 
D(A)  C  S  and  K(A)  (IS  =  0. 

Lemma  11.5  A  Bap  A  =  (p,q,s,t)  is  in  7(S)  if  and  only  if~pq,  ps,  and  qt  are 
edges  of  the  convex  hull  G Of(S),  there  is  a  facet  f\  incident  to  J>q  and  ps,  and  a 
different  facet  fi  incident  to  pq  and  qt.  Furthermore,  if  one  of  the  facets  f\  or 
f2  is  visible  from  a  point  x  £  P  then  x  £  K(A). 

We  leave  the  proof — which  involves  looking  precisely  at  the  cases  when 
points  are  collinear  or  coplanar,  but  which  is  otherwise  not  difficult — to  the 
reader. 


As  you  may  have  guessed,  the  flaps  take  over  the  role  of  the  horizon  edges. 

Lemma  11.6  The  expected  value  of  £ecard(P(e)),  where  the  summation  is 
over  all  horizon  edges  that  appear  at  some  stage  of  the  algorithm,  is  0(n  log  n). 

Proof.  Consider  an  edge  e  of  the  horizon  of  pr  on  GTC (Pr- 1 ) .  Let  A  =  (p,  q,  s,  t) 
be  one  of  the  two  flaps  with  ~pq  =  e.  By  Lemma  11.5,  A  €  7{Pr  i ) ,  and  the 
points  in  P\P,  that  can  see  one  of  the  facets  incident  to  e  are  all  in  K{ A),  so 
P(e)  C  K( A).  By  Theorem  9.15,  it  follows  that  the  expected  value  of 

£card(£(A)), 

A 

where  the  summation  is  over  all  flaps  A  appearing  in  at  least  one  T (Pr),  is 
bounded  by 

t,6(— ) 

The  cardinality  of  7{Pr)  is  twice  the  number  of  edges  of  G'K(Pr).  Therefore  it 
is  at  most  6 r—  12,  so  we  get  the  bound 

^card(F(e))  <  ^card(W(A))  <  ^  16^- — - ^  ^96«ln«.  0 

This  finishes  the  last  piece  of  the  analysis  of  the  convex  hull  algorithm.  We 
get  the  following  result: 
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Theorem  11.7  The  convex  hull  of  a  set  of  n  points  in  M3  can  be  computed  in 
0(nlogn)  randomized  expected  time. 


11.4*  Convex  Hulls  and  Half-Space  Intersection 


Section  11.4* 

CONVEX  HULLS  AND  HALF-SPACE 

In  Chapter  8  we  have  met  the  concept  of  duality.  The  strenth  of  duality  lies  in  intersection 

that  it  allows  us  to  look  at  a  problem  from  a  new  perspective,  which  can  lead  to 

more  insight  in  what  is  really  going  on.  Recall  that  we  denote  the  line  that  is 

the  dual  of  a  point  p  by  p" ,  and  the  point  that  is  the  dual  of  a  line  £  by  £* .  The 

duality  transform  is  incidence  and  order  preserving:  p  £  £  if  and  only  if  £*  £  p*, 

and  p  lies  above  £  if  and  only  if  £*  lies  above  p*. 

Let’s  have  a  closer  look  at  what  convex  hulls  correspond  to  in  dual  space. 

We  will  do  this  for  the  planar  case.  Let  P  be  a  set  of  points  in  the  plane.  For 
technical  reasons  we  focus  on  its  upper  convex  hull,  denoted  UJf(P),  which 
consists  of  the  convex  hull  edges  that  have  P  below  their  supporting  line — see 
the  left  side  of  Figure  1 1.4.  The  upper  convex  hull  is  a  polygonal  chain  that 
connects  the  leftmost  point  in  P  to  the  rightmost  one.  (We  assume  for  simplicity 
that  no  two  points  have  the  same  x-coordinate.) 

primal  plane  dual  plane 


WK(P) 


•  • 


Figure  11.4 

Upper  hulls  correspond  to  lower 
envelopes 


When  does  a  point  p  £  P  appear  as  a  vertex  of  the  upper  convex  hull?  That 
is  the  case  if  and  only  if  there  is  a  non-vertical  line  £  through  p  such  that  all 
other  points  of  P  lie  below  £.  In  the  dual  plane  this  statement  translates  to  the 
following  condition:  there  is  a  point  £*  on  the  line  p*  £  P*  such  that  £*  lies 
below  all  other  lines  of  P* .  If  we  look  at  the  arrangement  A(P*),  this  means  that 
p*  contributes  an  edge  to  the  unique  bottom  cell  of  the  arrangement.  This  cell  is 
the  intersection  of  the  half-planes  bounded  by  a  line  in  P*  and  lying  below  that 
line.  The  boundary  of  the  bottom  cell  is  an  x- monotone  chain.  We  can  define 
this  chain  as  the  minimum  of  the  linear  functions  whose  graphs  are  the  lines  in 
P* .  For  this  reason,  the  boundary  of  the  bottom  cell  in  an  arrangement  is  often 
called  the  lower  envelope  of  the  set  of  lines.  We  denote  the  lower  envelope  of 
P*  by  ££(£**) — see  the  right  hand  side  of  Figure  1 1.4. 

The  points  in  P  that  appear  on  ILIK(P)  do  so  in  order  of  increasing  x- 
coordinate.  The  lines  of  P*  appear  on  the  boundary  of  the  bottom  cell  in  order 
of  decreasing  slope.  Since  the  slope  of  the  line  p*  is  equal  to  the  x-coordinate  of 
p,  it  follows  that  the  left-to-right  list  of  points  on  ll'.K(P)  corresponds  exactly 
to  the  right-to-left  list  of  edges  of  ££(_P*).  So  the  upper  convex  hull  of  a  set  of 
points  is  essentially  the  same  as  the  lower  envelope  of  a  set  of  lines. 

Let’s  do  one  final  check.  Two  points  p  and  q  in  P  form  an  upper  convex 
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hull  edge  if  and  only  if  all  other  points  in  P  lie  below  the  line  £  through  p  and 
q.  In  the  dual  plane,  this  means  that  all  lines  r*,  with  r  S  P\  {p,q},  lie  above 
the  intersection  point  £*  of  p*  and  q* .  This  is  exactly  the  condition  under  which 
p*  fl  q*  is  a  vertex  of  L£(P*). 

What  about  the  lower  convex  hull  of  P  and  the  upper  envelope  of  P*1  (We 
leave  the  precise  definitions  to  the  reader.  )  By  symmetry,  these  concepts  are 
dual  to  each  other  as  well. 

We  now  know  that  the  intersection  of  lower  half-planes — half-planes  bounded 
from  above  by  a  non-vertical  line — can  be  computed  by  computing  an  upper 
convex  hull,  and  that  the  intersection  of  upper  half-planes  can  be  computed  by 
computing  a  lower  convex  hull.  But  what  if  we  want  to  compute  the  intersection 
of  an  arbitrary  set  H  of  half-planes?  Of  course,  we  can  split  the  set  H  into  a 
set  H 1  of  upper  half-planes  and  a  set  H  of  lower  half-planes,  compute  [J  II 
by  computing  the  lower  convex  hull  of  H  .*  and  \  J  II  by  computing  the  upper 
convex  hull  of  //_*,  and  then  compute  fj  H  by  intersecting  [J  H ,  and  [J  If  . 

But  is  this  really  necessary?  If  lower  envelopes  correspond  to  upper  convex 
hulls,  and  upper  envelopes  correspond  to  lower  convex  hulls,  shouldn’t  then 
the  intersection  of  arbitrary  half-planes  correspond  to  full  convex  hulls?  In  a 
sense,  this  is  true.  The  problem  is  that  our  duality  transformation  cannot  handle 
vertical  lines,  and  lines  that  are  close  to  vertical  but  have  opposite  slope  are 
mapped  to  very  different  points.  This  explains  why  the  dual  of  the  convex  hull 
consists  of  two  parts  that  lie  rather  far  apart. 

It  is  possible  to  define  a  different  duality  transformation  that  allows  vertical 
lines.  However,  to  apply  this  duality  to  a  given  set  of  half-planes,  we  need  a 
point  in  the  intersection  of  the  half-planes.  But  that  was  to  be  expected.  As  long 
as  we  do  not  want  to  leave  the  Euclidean  plane,  there  cannot  be  any  general 
duality  that  turns  the  intersection  of  a  set  of  half-planes  into  a  convex  hull, 
because  the  intersection  of  half-planes  can  have  one  special  property:  it  can  be 
empty.  What  could  that  possibly  correspond  to  in  the  dual?  The  convex  hull 
of  a  set  of  points  in  Euclidean  space  is  always  well  defined:  there  is  no  such 
thing  as  “emptiness.”  (This  problem  is  nicely  solved  if  one  works  in  oriented 
projective  space,  but  this  concept  is  beyond  the  scope  of  this  book.)  Only  once 
you  know  that  the  intersection  is  not  empty,  and  a  point  in  the  interior  is  known, 
can  you  define  a  duality  that  relates  the  intersection  with  a  convex  hull. 

We  leave  it  at  this  for  now.  The  important  thing  is  that — although  there  are 
technical  complications — convex  hulls  and  intersections  of  half-planes  (or  half¬ 
spaces  in  three  dimensions)  are  essentially  dual  concepts.  Hence,  an  algorithm 
to  compute  the  intersection  of  half-planes  in  the  plane  (or  half-spaces  in  three 
dimensions)  can  be  given  by  dualizing  a  convex-hull  algorithm. 


11.5*  Voronoi  Diagrams  Revisited 

In  Chapter  7  we  introduced  the  Voronoi  diagram  of  a  set  of  points  in  the  plane.  It 
may  come  as  a  surprise  that  there  is  a  close  relationship  between  planar  Voronoi 


diagrams  and  the  intersection  of  upper  half-spaces  in  3-dimensional  space.  By  Section  11.5* 
the  result  on  duality  of  the  previous  section,  this  implies  a  close  relation  between  voronoi  diagrams  revisited 
planar  Voronoi  diagrams  and  lower  convex  hulls  in  3-space. 

This  has  to  do  with  an  amazing  property  of  the  unit  paraboloid  in  3-space. 

Let  11  :=  (z  =  x2  +  y2)  denote  the  unit  paraboloid,  and  let  p  :=  (px,py,0)  be 
a  point  in  the  plane  z  =  0.  Consider  the  vertical  line  through  p.  It  intersects 
ll  in  the  point  p'  :=  ( px . py . p2  +  p2).  Let  h{p)  be  the  non-vertical  plane  z  = 

2pxx  +  2pyy  —  (p2  +  p2).  Notice  that  h{p)  contains  the  point  p'.  Now  consider 
any  other  point  q  :=  (qx,qy,  0)  in  the  plane  z  =  0.  The  vertical  line  through  q 
intersects  ll  in  the  point  q'  :=  (qx,qy,q2  +  q2),  and  it  intersects  h(p )  in 

<l(p)  ■=  {qx,qy,2pxqx  +  2pyqy-  (px+py))- 

The  vertical  distance  between  q'  and  qip)  is 

q2x  +  q2  -  2 pxqx  -  2 pyqy  +  p2x  +  p2  =  (qx  -  px)2  +  (qy  -  py)2  =  dist (p,q)2. 

Hence,  the  plane  h(p)  encodes — together  with  the  unit  paraboloid — the  distance 
between  p  and  any  other  point  in  the  plane  z  =  0.  (Since  distf p.  q)2  f  0  for 
any  point  q ,  and  p'  £  h(p),  this  also  implies  that  h(p)  is  the  tangent  plane  to  ll 
at  p'.) 

The  fact  that  the  plane  h(p)  encodes  the  distance  of  other  points  to  p  leads  to 
a  correspondence  between  Voronoi  diagrams  and  upper  envelopes,  as  explained 
next.  Let  P  be  a  planar  point  set,  which  we  imagine  to  lie  in  the  plane  z  =  0  of 
3-dimensional  space.  Consider  the  set  H  :=  {h(p)  \  p  £  P}  of  planes,  and  let 
U£(H)  be  the  upper  envelope  of  the  planes  in  H.  We  claim  that  the  projection 
of  U£(H)  on  the  plane  z  =  0  is  the  Voronoi  diagram  of  P.  Figure  1 1.5  illustrates 
this  one  dimension  lower:  the  Voronoi  diagram  of  the  points  pj  on  the  line  y  =  0 
is  the  projection  of  the  upper  envelope  of  the  lines  h(pi). 

Theorem  11.8  LetP  be  a  set  of  points  in  3-dimensional  space,  all  lying  in  the 
plane  z  =  0.  Let  H  be  the  set  of  planes  h(p),  for  p  £  P,  defined  as  above.  Then 
the  projection  o file  (7/  j  on  the  plane  z  =  0  is  the  Voronoi  diagram  ofP. 

Proof.  To  prove  the  theorem,  we  will  show  that  the  Voronoi  cell  of  a  point 
p  £  P  is  exactly  the  projection  of  the  facet  of  U£(H)  that  lies  on  the  plane  h(p). 

Let  q  be  a  point  in  the  plane  z  =  0  lying  in  the  Voronoi  cell  of  p.  Hence,  we 
have  dist(g,/;>)  <  distf/.  r)  for  all  r  £  P  with  r  /  p.  We  have  to  prove  that  the 
vertical  line  through  q  intersects  U£(H)  at  a  point  lying  on  h(p).  Recall  that  for 
a  point  r  £  P,  the  plane  h(r)  is  intersected  by  the  vertical  line  through  q  at  the 
point  q{r)  :=  (qx,qy,q2  +  q2  —  dist(g,  r)2).  Of  all  points  in  P,  the  point  p  has 
the  smallest  distance  to  q,  so  q(p)  is  the  highest  intersection  point.  Hence,  the 
vertical  line  through  q  intersects  U£(//)  at  apoint  lying  on  h(p),  as  claimed.  0 

This  theorem  implies  that  we  can  compute  a  Voronoi  diagram  in  the  plane  by 
computing  the  upper  envelope  of  a  set  of  planes  in  3-space.  By  Exercise  11.10 
(see  also  the  previous  section),  the  upper  envelope  of  a  set  of  planes  in  3-space 
is  in  one-to-one  correspondence  to  the  lower  convex  hull  of  the  points  H* ,  so 
we  can  immediately  use  our  algorithm  ConvexHull. 
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Figure  11.5 

The  correspondence  between  Voronoi 
diagrams  and  upper  envelopes 


Not  surprisingly,  the  lower  convex  hull  of  H*  has  a  geometric  meaning  as 
well:  its  projection  on  the  plane  z  =  0  is  the  Delaunay  graph  of  P. 


11.6  Notes  and  Comments 

The  early  convex  hull  algorithms  worked  only  for  points  in  the  plane — see 
the  notes  and  comments  of  Chapter  1  for  a  discussion  of  these  algorithms. 
Computing  convex  hulls  in  3-dimensional  space  turns  out  to  be  considerably 
more  difficult.  One  of  the  first  algorithms  was  the  “gift  wrapping”  algorithm 
due  to  Chand  and  Kapur  [84].  It  finds  facet  after  facet  by  “rotating”  a  plane 
over  known  edges  of  the  hull  until  the  first  point  is  found.  The  running  time  is 
0(nf )  for  a  convex  hull  with  /  facets,  which  is  0(n2)  in  the  worst  case.  The 
first  algorithm  to  achieve  O(nlogn)  running  time  was  a  divide-and-conquer 
algorithm  by  Preparata  and  Hong  [322,  323].  Early  incremental  algorithms  run 
in  time  0{n2)  [223,  344].  The  randomized  version  presented  here  is  due  to 
Clarkson  and  Shor  [133].  The  version  we  presented  needs  0(n  logn)  space;  the 
original  paper  gives  a  simple  improvement  to  linear  space.  The  idea  of  a  conflict 
graph,  used  here  for  the  first  time  in  this  book,  also  comes  from  the  paper  of 
Clarkson  and  Shor.  Our  analysis,  however,  is  due  to  Mulmuley  [290]. 

In  this  chapter  we  have  concentrated  on  3-dimensional  space,  where  convex 
hulls  have  linear  complexity.  The  so-called  Upper  Bound  Theorem  states  that 
the  worst-case  combinatorial  complexity  of  the  convex  hull  of  n  points  in  d- 
dimensional  space — phrased  in  dual  space:  the  intersection  of  n  half-spaces — is 
&{n i'1'2.  J.  (We  proved  this  result  for  the  case  d  =  3,  using  Euler’s  relation.) 
The  algorithm  described  in  this  chapter  generalizes  to  higher  dimensions,  and  is 
optimal  in  the  worst  case:  its  expected  running  time  is  ©(n^/2 J )  Interestingly, 
the  best  known  deterministic  convex  hull  algorithm  for  odd-dimensional  spaces 

_  is  based  on  a  (quite  complicated)  derandomization  of  this  algorithm  [97].  Since 

256  the  convex  hull  in  dimensions  greater  than  three  can  have  non-linear  complexity. 


output-sensitive  algorithms  may  be  useful.  The  best  known  output-sensitive 
algorithm  for  computing  convex  hulls  in  W1  is  due  to  Chan  [82].  Its  running 
time  is  O(n\ogk  +  (n^)1-1/^/2^1)  log0^) n),  where  k  denotes  the  complexity 
of  the  convex  hull.  A  good  overview  of  the  many  results  on  convex-hull 
computations  is  given  in  the  survey  by  Seidel  [347].  Readers  who  want  to  know 
more  about  the  mathematical  aspects  of  polytopes  in  higher  dimensions  can 
consult  Griinbaum’s  book  [194],  which  is  a  classical  reference  for  polytope 
theory,  or  Ziegler’s  book  [399],  which  treats  the  combinatorial  aspects. 

In  Section  1 1.5  we  have  seen  that  the  Voronoi  diagram  of  a  planar  point  set  is 
the  projection  of  the  upper  envelope  of  a  certain  set  of  planes  in  3-dimensional 
space.  A  similar  statement  is  true  in  higher  dimensions:  the  Voronoi  diagram 
of  a  set  of  points  in  W1  is  the  projection  of  the  upper  envelope  of  a  certain  set 
of  hyperplanes  in  1 .  Not  all  sets  of  (hyper)planes  define  an  upper  envelope 
whose  projection  is  the  Voronoi  diagram  of  some  point  set.  Interestingly,  any 
upper  envelope  does  project  onto  a  so-called  power  diagram,  a  generalization  of 
the  Voronoi  diagram  where  the  sites  are  spheres  rather  than  points  [25]. 


11.7  Exercises 

11.1  In  Chapter  1  we  defined  the  convex  hull  of  a  set  P  of  points  as  the 
intersection  of  all  convex  sets  containing  the  points.  In  the  current 
chapter  we  saw  another  definition:  the  convex  hull  of  P  is  the  set  of  all 
convex  combinations  of  the  points  in  P.  Prove  that  these  two  definitions 
are  equivalent,  that  is,  prove  that  a  point  q  is  a  convex  combination  of 
points  in  P  if  and  only  if  q  lies  in  every  convex  set  containing  P. 

1 1.2  Prove  that  the  worst  case  running  time  of  algorithm  ConvexHull  is 
(9(n3),  and  that  there  are  sets  of  points  where  a  bad  choice  of  the  random 
permutation  makes  the  algorithm  actually  need  @(n3)  time. 

11.3  Describe  a  randomized  incremental  algorithm  to  compute  the  convex 
hull  of  n  points  in  the  plane.  Describe  how  to  deal  with  degeneracies. 
Analyze  the  expected  running  time  of  your  algorithm. 

1 1 .4  In  many  applications,  only  a  small  percentage  of  the  points  in  a  given  set 
P  of  ii  points  are  extreme.  In  such  a  case,  the  convex  hull  of  P  has  less 
than  n  vertices.  This  can  actually  make  our  algorithm  ConvexHull 
run  faster  than  @(n logn). 

Assume,  for  instance,  that  the  expected  number  of  extreme  points  in  a 
random  sample  of  P  of  size  r  is  <9(r“),  for  some  constant  a  <  1.  (This 
is  true  when  the  set  P  has  been  created  by  picking  points  uniformly  at 
random  in  a  ball.)  Prove  that  under  this  condition,  the  running  time  of 
the  algorithm  is  O(n). 

1 1.5  The  convex  hull  of  a  set  P  of  n  points  in  3-dimensional  space  can  also 
be  computed  by  ’’rotating”  a  plane  over  known  edges  of  the  convex 
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hull,  thereby  discovering  new  facets.  Give  a  detailed  description  of  an 
algorithm  using  this  approach,  and  analyze  its  running  time. 

1 1 .6  Describe  a  data  structure  that  allows  you  to  test  whether  a  query  point 
q  lies  inside  a  convex  polytope  with  n  vertices  in  R3.  (Hint:  Use  the 
results  from  Chapter  6.) 

1 1.7  Define  a  simple  polytope  to  be  a  region  in  3-space  that  is  topologically 
equivalent  to  a  ball  (but  not  necessarily  convex)  and  whose  boundary 
consists  of  planar  polygons.  Describe  how  to  test  in  O(n)  time  whether 
a  point  lies  inside  a  simple  polytope  with  n  vertices  in  3-dimensional 
space. 

11.8  Describe  a  randomized  incremental  algorithm  to  compute  the  intersec¬ 
tion  of  half-planes,  and  analyze  its  expected  running  time.  Your  algo¬ 
rithm  should  maintain  the  intersection  of  the  current  set  of  half-planes. 
To  figure  out  where  to  insert  a  new  half-plane,  maintain  a  conflict  graph 
between  the  vertices  of  the  current  intersection  and  the  half-planes  that 
are  still  to  be  inserted. 

1 1 .9  Describe  a  randomized  incremental  algorithm  to  compute  the  intersec¬ 
tion  of  half-spaces  in  3 -dimensional  space,  and  analyze  its  expected 
running  time.  Maintain  a  conflict  graph  analogous  to  the  previous  exer¬ 
cise. 

11.10  In  this  exercise  you  have  to  work  out  the  details  of  a  3-dimensional 
duality  transformation.  Given  a  point  p  :=  (pXiPy,Pz)  in  R3*  let  P  be 
the  plane  z  =  pxx  +  pyy  p--  For  a  non- vertical  plane  h,  define  h*  such 
that  (li  f  =  h.  Give  a  definition  of  the  upper  convex  hull  ll‘Jt(P)  of 
a  set  of  points  P  and  the  lower  envelope  ££(//)  of  a  set  H  of  planes 
in  3-space,  similar  to  the  way  they  were  defined  for  the  planar  case  in 
Section  11.4. 

Show  the  following  properties. 

■  A  point  p  lies  on  a  plane  h  if  and  only  if  h*  lies  on  p* . 
m  A  point  p  lies  above  h  if  and  only  if  h*  lies  above  p* . 

m  A  point  p  £  P  is  a  vertex  of  1 iflf(P)  if  and  only  if  p*  appears  on 

££(/>*). 

■  A  segment  ~pq  is  an  edge  of  U0{ (P)  if  and  only  if  p*  and  q*  share  an 
edge  on  ££(.P*). 

■  Points  p\,p2,...,Pk  are  the  vertices  of  a  facet  /  of  Ur.K(P)  if  and 
only  if  pi*,p2*,.  ■  ■  ,Pk*  support  facets  of  Z £ ( P* )  that  share  a  com¬ 
mon  vertex. 


258 


12  Binary  Space  Partitions 

The  Painter’s  Algorithm 


These  days  pilots  no  longer  have  their  first  flying  experience  in  the  air,  but  on 
the  ground  in  a  flight  simulator.  This  is  cheaper  for  the  air  company,  safer  for 
the  pilot,  and  better  for  the  environment.  Only  after  spending  many  hours  in  the 
simulator  are  pilots  allowed  to  operate  the  control  stick  of  a  real  airplane.  Flight 
simulators  must  perform  many  different  tasks  to  make  the  pilot  forget  that  she 
is  sitting  in  a  simulator.  An  important  task  is  visualization:  pilots  must  be  able 
to  see  the  landscape  above  which  they  are  flying,  or  the  runway  on  which  they 
are  landing.  This  involves  both  modeling  landscapes  and  rendering  the  models. 
To  render  a  scene  we  must  determine  for  each  pixel  on  the  screen  the  object 
that  is  visible  at  that  pixel;  this  is  called  hidden  surface  removal.  We  must 
also  perform  shading  calculations,  that  is,  we  must  compute  the  intensity  of  the 
light  that  the  visible  object  emits  in  the  direction  of  the  view  point.  The  latter 
task  is  very  time-consuming  if  highly  realistic  images  are  desired:  we  must 
compute  how  much  light  reaches  the  object — either  directly  from  light  sources 
or  indirectly  via  reflections  on  other  objects — and  consider  the  interaction  of 
the  light  with  the  surface  of  the  object  to  see  how  much  of  it  is  reflected  in  the 
direction  of  the  view  point.  In  flight  simulators  rendering  must  be  performed  in 
real-time,  so  there  is  no  time  for  accurate  shading  calculations.  Therefore  a  fast 
and  simple  shading  technique  is  employed  and  hidden  surface  removal  becomes 
an  important  factor  in  the  rendering  time. 

The  z-buffer  algorithm  is  a  very  simple  method  for  hidden  surface  removal. 
This  method  works  as  follows.  First,  the  scene  is  transformed  such  that  the 
viewing  direction  is  the  positive  "-direction.  Then  the  objects  in  the  scene 
are  scan-converted  in  arbitrary  order.  Scan-converting  an  object  amounts  to 
determining  which  pixels  it  covers  in  the  projection;  these  are  the  pixels  where 
the  object  is  potentially  visible.  The  algorithm  maintains  information  about  the 
already  processed  objects  in  two  buffers:  a  frame  buffer  and  a  z-buffer.  The 
frame  buffer  stores  for  each  pixel  the  intensity  of  the  currently  visible  object, 
that  is,  the  object  that  is  visible  among  those  already  processed.  The  z-buffer 
stores  for  each  pixel  the  z-coordinate  of  the  currently  visible  object.  (More 
precisely,  it  stores  the  z-coordinate  of  the  point  on  the  object  that  is  visible  at 
the  pixel.)  Now  suppose  that  we  select  a  pixel  when  scan-converting  an  object. 
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Figure  12.1 
The  painter’s  algorithm  in  action 
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If  the  ’-coordinate  of  the  object  at  that  pixel  is  smaller  than  the  7-coordinate 
stored  in  the  ’-buffer,  then  the  new  object  lies  in  front  of  the  currently  visible 
object.  So  we  write  the  intensity  of  the  new  object  to  the  frame  buffer,  and  its 
Z-coordinate  to  the  z-buffer.  If  the  z-coordinate  of  the  object  at  that  pixel  is  larger 
than  the  z-coordinate  stored  in  the  z-buffer,  then  the  new  object  is  not  visible, 
and  the  frame  buffer  and  z-buffer  remain  unchanged.  The  z-buffer  algorithm  is 
easily  implemented  in  hardware  and  quite  fast  in  practice.  Hence,  this  is  the 
most  popular  hidden  surface  removal  method.  Nevertheless,  the  algorithm  has 


some  disadvantages:  a  large  amount  of  extra  storage  is  needed  for  the  z-buffer, 
and  an  extra  test  on  the  z-coordinate  is  required  for  every  pixel  covered  by  an 
object.  The  painter’s  algorithm  avoids  these  extra  costs  by  first  sorting  the 
objects  according  to  their  distance  to  the  view  point.  Then  the  objects  are  scan- 
converted  in  this  so-called  depth  order ,  starting  with  the  object  farthest  from 
the  view  point.  When  an  object  is  scan-converted  we  do  not  need  to  perform 
any  test  on  its  z-coordinate,  we  always  write  its  intensity  to  the  frame  buffer. 
Entries  in  the  frame  buffer  that  have  been  filled  before  are  simply  overwritten. 
Figure  12.1  illustrates  the  algorithm  on  a  scene  consisting  of  three  triangles. 
On  the  left,  the  triangles  are  shown  with  numbers  corresponding  to  the  order 
in  which  they  are  scan-converted.  The  images  after  the  first,  second,  and  third 
triangle  have  been  scan-converted  are  shown  as  well.  This  approach  is  correct 
because  we  scan-convert  the  objects  in  back-to-front  order:  for  each  pixel  the 
last  object  written  to  the  corresponding  entry  in  the  frame  buffer  will  be  the  one 
closest  to  the  viewpoint,  resulting  in  a  correct  view  of  the  scene.  The  process 
resembles  the  way  painters  work  when  they  put  layers  of  paint  on  top  of  each 
other,  hence  the  name  of  the  algorithm. 

To  apply  this  method  successfully  we  must  be  able  to  sort  the  objects  quickly. 
Unfortunately  this  is  not  so  easy.  Even  worse,  a  depth  order  may  not  always 
exist:  the  in-front-of  relation  among  the  objects  can  contain  cycles.  When  such 
a  cyclic  overlap  occurs,  no  ordering  will  produce  a  correct  view  of  this  scene.  In 
this  case  we  must  break  the  cycles  by  splitting  one  or  more  of  the  objects,  and 
hope  that  a  depth  order  exists  for  the  pieces  that  result  from  the  splitting.  When 
there  is  a  cycle  of  three  triangles,  for  instance,  we  can  always  split  one  of  them 
into  a  triangular  piece  and  a  quadrilateral  piece,  such  that  a  correct  displaying 
order  exists  for  the  resulting  set  of  four  objects.  Computing  which  objects  to 
split,  where  to  split  them,  and  then  sorting  the  object  fragments  is  an  expensive 
process.  Because  the  order  depends  on  the  position  of  the  view  point,  we  must 
recompute  the  order  every  time  the  view  point  moves.  If  we  want  to  use  the 
painter’s  algorithm  in  a  real-time  environment  such  as  flight  simulation,  we 
should  preprocess  the  scene  such  that  a  correct  displaying  order  can  be  found 


quickly  for  any  view  point.  An  elegant  data  structure  that  makes  this  possible  is  Section  12.1 

the  binary  space  partition  tree,  or  BSP  tree  for  short.  the  definition  of  bsp  trees 


12.1  The  Definition  of  BSP  Trees 

To  get  a  feeling  for  what  a  BSP  tree  is,  take  a  look  at  Figure  12.2.  This  figure 
shows  a  binary  space  partition  (BSP)  for  a  set  of  objects  in  the  plane,  together 
with  the  tree  that  corresponds  to  the  BSP.  As  you  can  see,  the  binary  space 
partition  is  obtained  by  recursively  splitting  the  plane  with  a  line:  first  we  split 
the  entire  plane  with  /:) ,  then  we  split  the  half-plane  above  l\  with  h  and  the 
half-plane  below  l\  with  l2,  and  so  on.  The  splitting  lines  not  only  partition  the 
plane,  they  may  also  cut  objects  into  fragments.  The  splitting  continues  until 
there  is  only  one  fragment  left  in  the  interior  of  each  region.  This  process  is 


naturally  modeled  as  a  binary  tree.  Each  leaf  of  this  tree  corresponds  to  a  face 
of  the  final  subdivision;  the  object  fragment  that  lies  in  the  face  is  stored  at  the 
leaf.  Each  internal  node  corresponds  to  a  splitting  line;  this  line  is  stored  at  the 
node.  When  there  are  1 -dimensional  objects  (line  segments)  in  the  scene  then 
objects  could  be  contained  in  a  splitting  line;  in  that  case  the  corresponding 
internal  node  stores  these  objects  in  a  list. 


For  a  hyperplane  h  :  a\X\  +  <22X2  H - 1-  Q-dX-d  +  ad+ 1  =  0,  we  let  h+  be  the  open 

positive  half-space  bounded  by  h  and  we  let  hr  be  the  open  negative  half-space: 


h+  :={(xi,X2,--.,xd)  :aixi+a2X2-\ - \-adxd  +  ad+i  >  0} 


and 


h  ■=  {(xi,x2,...,xd)  :  aixl+a2X2~\ - \-adxd+ad+ 1  <  0}. 

A  binary  space  partition  tree,  or  BSP  tree,  for  a  set  S  of  objects  in  ^-dimensional 
space  is  now  defined  as  a  binary  tree  T  with  the  following  properties: 

■  If  card  (5)  ^  1  then  T  is  a  leaf;  the  object  fragment  in  S  (if  it  exists)  is  stored 

explicitly  at  this  leaf.  If  the  leaf  is  denoted  by  v,  then  the  (possibly  empty)  _ 

set  stored  at  the  leaf  is  denoted  by  S(v).  261 
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Figure  12.3 

The  correspondence  between  nodes  and 

regions 
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■  If  card(S)  >  1  then  the  root  v  of  T  stores  a  hyperplane  hv,  together  with  the 
set  S(v)  of  objects  that  are  fully  contained  in  hv.  The  left  child  of  v  is  the 
root  of  a  BSP  tree  T-  for  the  set  S~  :=  {hy  fls  :  s  €  5},  and  the  right  child 
of  v  is  the  root  of  a  BSP  tree  T+  for  the  set  S+  :=  {/;+  n  s  :  s  G  S}. 

The  size  of  a  BSP  tree  is  the  total  size  of  the  sets  .S’ f  v )  over  all  nodes  v  of  the 
BSP  tree.  In  other  words,  the  size  of  a  BSP  tree  is  the  total  number  of  object 
fragments  that  are  generated.  If  the  BSP  does  not  contain  useless  splitting 
lines — lines  that  split  off  an  empty  subspace — then  the  number  of  nodes  of  the 
tree  is  at  most  linear  in  the  size  of  the  BSP  tree.  Strictly  speaking,  the  size 
of  the  BSP  tree  does  not  say  anything  about  the  amount  of  storage  needed  to 
store  it,  because  it  says  nothing  about  the  amount  of  storage  needed  for  a  single 
fragment.  Nevertheless,  the  size  of  a  BSP  tree  as  we  defined  it  is  a  good  measure 
to  compare  the  quality  of  different  BSP  trees  for  a  given  set  of  objects. 


The  leaves  in  a  BSP  tree  represent  the  faces  in  the  subdivision  that  the  BSP 
induces.  More  generally,  we  can  identify  a  convex  region  with  each  node  v  in  a 
BSP  tree  T:  this  region  is  the  intersection  of  the  half-spaces  ,  where  fi  is  an 
ancestor  of  v  and  o  =  —  when  v  is  in  the  left  subtree  of  jl,  and  o  =  +  when  it 
is  in  the  right  subtree.  The  region  corresponding  to  the  root  of  T  is  the  whole 
space.  Figure  12.3  illustrates  this:  the  grey  node  corresponds  to  the  grey  region 

£^r\£}n£^. 


The  splitting  hyperplanes  used  in  a  BSP  can  be  arbitrary.  For  computational 
purposes,  however,  it  can  be  convenient  to  restrict  the  set  of  allowable  splitting 
hyperplanes.  A  usual  restriction  is  the  following.  Suppose  we  want  to  construct 
a  BSP  for  a  set  of  line  segments  in  the  plane.  An  obvious  set  of  candidates  for 
the  splitting  lines  is  the  set  of  extensions  of  the  input  segments.  A  BSP  that  only 
uses  such  splitting  lines  is  called  an  auto-partition.  For  a  set  of  planar  polygons 
in  3-space,  an  auto-partition  is  a  BSP  that  only  uses  planes  through  the  input 
polygons  as  splitting  planes.  It  seems  that  the  restriction  to  auto-partitions  is  a 
severe  one.  But,  although  auto-partitions  cannot  always  produce  minimum-size 
BSP  trees,  we  shall  see  that  they  can  produce  reasonably  small  ones. 


12.2  BSP  Trees  and  the  Painter’s  Algorithm 


Suppose  we  have  built  a  BSP  tree  Ton  a  set  S  of  objects  in  3-dimensional 
space.  How  can  we  use  T  to  get  the  depth  order  we  need  to  display  the  set  S 
with  the  painter’s  algorithm?  Let  pv ;ew  be  the  view  point  and  suppose  that  pview 
lies  above  the  splitting  plane  stored  at  the  root  of  T.  Then  clearly  none  of  the 
objects  below  the  splitting  plane  can  obscure  any  of  the  objects  above  it.  Hence, 
we  can  safely  display  all  the  objects  (more  precisely,  object  fragments)  in  the 
subtree  T  before  displaying  those  in  T+.  The  order  for  the  object  fragments 
in  the  two  subtrees  T+  and  is  obtained  recursively  in  the  same  way.  This  is 
summarized  in  the  following  algorithm. 


Algorithm  PAINTERS  ALGORITHM(T,pview) 

1 .  Let  v  be  the  root  of  T. 

2.  if  v  is  a  leaf 

3.  then  Scan-convert  the  object  fragments  in  S(v). 

4.  else  if  pview  €  h+ 

5.  then  Painters ALGORiTHM(T~,pview) 

6.  Scan-convert  the  object  fragments  in  S(v). 

7.  Painters  ALGORiTHM(T+,pview) 

8.  else  if  p v i c w  £  (ty 

9.  then  Painters ALGORiTHM(T+,pview) 

10.  Scan-convert  the  object  fragments  in  ,S'(  V). 

11.  Painters  Algorithm^,  pView) 

12.  else  (^  Pview  ^  hv  ^0 

13.  Painters  ALGORiTHM(T+,pView) 

14.  Painters  Algorithm^,  pview) 


Note  that  we  do  not  draw  the  polygons  in  S(v)  when  pview  lies  on  the  splitting 
plane  hv,  because  polygons  are  flat  2-dimensional  objects  and  therefore  not 
visible  from  points  that  lie  in  the  plane  containing  them. 


The  efficiency  of  this  algorithm — indeed,  of  any  algorithm  that  uses  BSP  trees — 
depends  largely  on  the  size  of  the  BSP  tree.  So  we  must  choose  the  splitting 
planes  in  such  a  way  that  fragmentation  of  the  objects  is  kept  to  a  minimum. 
Before  we  can  develop  splitting  strategies  that  produce  small  BSP  trees,  we 
must  decide  on  which  types  of  objects  we  allow.  We  became  interested  in  BSP 
trees  because  we  needed  a  fast  way  of  doing  hidden  surface  removal  for  flight 
simulators.  Because  speed  is  our  main  concern,  we  should  keep  the  type  of 
objects  in  the  scene  simple:  we  should  not  use  curved  surfaces,  but  represent 
everything  in  a  polyhedral  model.  We  assume  that  the  facets  of  the  polyhedra 
have  been  triangulated.  So  we  want  to  construct  a  BSP  tree  of  small  size  for  a 
given  set  of  triangles  in  3-dimensional  space. 


Section  12.2 

BSP  TREES  AND  THE  PAINTER’S 
ALGORITHM 


hy 
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When  you  want  to  solve  a  3 -dimensional  problem,  it  is  usually  not  a  bad  idea  to 
gain  some  insight  by  first  studying  the  planar  version  of  the  problem.  This  is 
also  what  we  do  in  this  section. 

Let  S  be  a  set  of  n  non-intersecting  line  segments  in  the  plane.  We  will  restrict 
our  attention  to  auto-partitions,  that  is,  we  only  consider  lines  containing  one  of 
the  segments  in  S  as  candidate  splitting  lines.  The  following  recursive  algorithm 
for  constructing  a  BSP  immediately  suggests  itself.  Let  £(s)  denote  the  line  that 
contains  a  segment  s. 

Algorithm  2  D  B  S  P(.S’) 

Input.  A  set  S  =  {si,S2i  •  •  •  ,sn}  of  segments. 

Output.  A  BSP  tree  for  S. 

1.  ifcard(S)  ^  1 

2.  then  Create  a  tree  7  consisting  of  a  single  leaf  node,  where  the  set  S  is 
stored  explicitly. 

3.  return  7 

4.  else  (*  Use  £(s i)  as  the  splitting  line.  *) 

5.  S+^{sn£(>i)+:seS};  7+  <-2dBsp(S+) 

6.  S~  <-  {sn^(si)-  :seS};  T~  <-2dBsp(S-) 

7.  Create  a  BSP  tree  T  with  root  node  v,  left  subtree  7~,  right  sub¬ 
tree  T+,  and  with  S(v)  =  {s  €  S  :  s  C  f(si)}. 

8.  return  7 

The  algorithm  clearly  constructs  a  BSP  tree  for  the  set  S.  But  is  it  a  small  one? 
Perhaps  we  should  spend  a  little  more  effort  in  choosing  the  right  segment  to  do 
the  splitting,  instead  of  blindly  taking  the  first  segment,  ,v  i .  One  approach  that 
comes  to  mind  is  to  take  the  segment  s  £  S  such  that  £(s)  cuts  as  few  segments 
as  possible.  But  this  is  too  greedy:  there  are  configurations  of  segments  where 
this  approach  doesn’t  work  well.  Furthermore,  finding  this  segment  would  be 
time  consuming.  What  else  can  we  do?  Perhaps  you  already  guessed:  as  in 
previous  chapters  where  we  had  to  make  a  difficult  choice,  we  simply  make  a 
random  choice.  That  is  to  say,  we  use  a  random  segment  to  do  the  splitting.  As 
we  shall  see  later,  the  resulting  BSP  is  expected  to  be  fairly  small. 

To  implement  this,  we  put  the  segments  in  random  order  before  we  start  the 
construction: 

Algorithm  2dRandomBsp(S) 

1 .  Generate  a  random  permutation  S'  =  si , . . . ,  sn  of  the  set  S. 

2.  T  4— 2dBsp(5') 

3.  return  T 

Before  we  analyze  this  randomized  algorithm,  we  note  that  one  simple  opti¬ 
mization  is  possible.  Suppose  that  we  have  chosen  the  first  few  partition  lines. 
These  lines  induce  a  subdivision  of  the  plane  whose  faces  correspond  to  nodes 
in  the  BSP  tree  that  we  are  constructing.  Consider  one  such  face  /.  There  can 
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be  segments  that  cross  /  completely.  Selecting  one  of  these  crossing  segments 
to  split  /  will  not  cause  any  fragmentation  of  other  segments  inside  /,  while  the 
segment  itself  can  be  excluded  from  further  consideration.  It  would  be  foolish 
not  to  take  advantage  of  such  free  splits.  So  our  improved  strategy  is  to  make 
free  splits  whenever  possible,  and  to  use  random  splits  otherwise.  To  implement 
this  optimization,  we  must  be  able  to  tell  whether  a  segment  is  a  free  split.  To 
this  end  we  maintain  two  boolean  variables  with  each  segment,  which  indicate 
whether  the  left  and  right  endpoint  lie  on  one  of  the  already  added  splitting  lines. 
When  both  variables  become  true,  then  the  segment  is  a  free  split. 

We  now  analyze  the  performance  of  algorithm  2dRandomBsp.  To  keep  it 
simple,  we  will  analyze  the  version  without  free  splits.  (In  fact,  free  splits  do 
not  make  a  difference  asympotically.) 

We  start  by  analyzing  the  size  of  the  BSP  tree  or,  in  other  words,  the  number 
of  fragments  that  are  generated.  Of  course,  this  number  depends  heavily  on 
the  particular  permutation  generated  in  line  1 :  some  permutations  may  give 
small  BSP  trees,  while  others  give  very  large  ones.  As  an  example,  consider  the 
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Figure  12.4 

Different  orders  give  different  BSPs 


collection  of  three  segments  depicted  in  Figure  12.4.  If  the  segments  are  treated 
as  illustrated  in  part  (a)  of  the  figure,  then  five  fragments  result.  A  different 
order,  however,  gives  only  three  fragments,  as  shown  in  part  (b).  Because  the 
size  of  the  BSP  varies  with  the  permutation  that  is  used,  we  will  analyze  the 
expected  size  of  the  BSP  tree,  that  is,  the  average  size  over  all  n!  permutations. 

Lemma  12.1  The  expected  number  of  fragments  generated  by  the  algorithm 
2dRandomBsp  is  0(n\ogn). 

Proof.  Let  s,  be  a  fixed  segment  in  S.  We  shall  analyze  the  expected  number 
of  other  segments  that  are  cut  when  £{sf)  is  added  by  the  algorithm  as  the  next 
splitting  line. 

In  Figure  12.4  we  can  see  that  whether  or  not  a  segment  Sj  is  cut  when  £(si) 
is  added — assuming  it  can  be  cut  at  all  by  £(sj) — depends  on  segments  that  are 
also  cut  by  £(s,)  and  are  ‘in  between’ s,-  and  Sj.  In  particular,  when  the  line 
through  such  a  segment  is  used  before  then  it  shields  S;  from  Sj.  This  is 
what  happened  in  Figure  12.4(b):  the  segment  ,st  shielded  ,v  ;  from  sj.  These 
considerations  lead  us  to  define  the  distance  of  a  segment  with  respect  to  the 
fixed  segment  sg 

(  the  number  of  segments  intersecting  if  £(sj)  intersects  Sj 
distjTs.-)  =  <  £(sj)  in  between  s,-  and  Sj 


otherwise 
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Chapter  12  F°r  any  finite  distance,  there  are  at  most  two  segments  at  that  distance,  one  on 
BINARY  SPACE  PARTITIONS  either  side  of  Sj. 

Let  k  :=  distSi (sj),  and  let  Sjl,Sj2,...,Sj.  be  the  segments  in  between  s,- 
and  sj.  What  is  the  probability  that  £(si)  cuts  Sj  when  added  as  a  splitting  line? 
For  this  to  happen,  ,y,  must  come  before  sj  in  the  random  ordering  and,  moreover, 
it  must  come  before  any  of  the  segments  in  between  s,  and  Sj,  which  shield 
sj  from  Sj.  In  other  words,  of  the  set  {/././] , . . . ,  j/,}  of  indices,  i  must  be  the 
smallest  one.  Because  the  order  of  the  segments  is  random,  this  implies 


Pr[f(s;)  CUtS  Sj]  < 


1 

distSi.(s;)+2' 


Notice  that  there  can  be  segments  that  are  not  cut  by  £(s,)  but  whose  extension 
shields  Sj.  This  explains  why  the  expression  above  is  not  an  equality. 

We  can  now  bound  the  expected  total  number  of  cuts  generated  by  s,: 


E  [number  of  cuts  generated  by  s;] 


E 

jf=i 


1 

distVj.  (s  j)  +  2 


n—2 


2E 

k= 0 


1 

k  +  2 


2  In  n. 


By  linearity  of  expectation,  we  can  conclude  that  the  expected  total  number  of 
cuts  generated  by  all  segments  is  at  most  2nlnn.  Since  we  start  with  n  segments, 
the  expected  total  number  of  fragments  is  bounded  by  n  +  2 n  In n.  0 


We  have  shown  that  the  expected  size  of  the  BSP  that  is  generated  by 
2dRandomBsp  is  n  +  2nlnn.  As  a  consequence,  we  have  proven  that  a  BSP 
of  size  n  +  2n\nn  exists  for  any  set  of  n  segments.  Furthermore,  at  least  half  of 
all  permutations  lead  to  a  BSP  of  size  n  +  4n\nn.  We  can  use  this  to  find  a  BSP 
of  that  size:  After  running  2dRandomBsp  we  test  the  size  of  the  tree,  and  if  it 
exceeds  that  bound,  we  simply  start  the  algorithm  again  with  a  fresh  random 
permutation.  The  expected  number  of  trials  is  two. 


We  have  analyzed  the  size  of  the  BSP  that  is  produced  by  2dRandomBSP. 
What  about  the  running  time?  Again,  this  depends  on  the  random  permutation 
that  is  used,  so  we  look  at  the  expected  running  time.  Computing  the  random 
permutation  takes  linear  time.  If  we  ignore  the  time  for  the  recursive  calls, 
then  the  time  taken  by  algorithm  2dBsp  is  linear  in  the  number  of  fragments 
in  S.  This  number  is  never  larger  than  n — in  fact,  it  gets  smaller  with  each 
recursive  call.  Finally,  the  number  of  recursive  calls  is  obviously  bounded  by 
the  total  number  of  generated  fragments,  which  is  O(nlogn).  Hence,  the  total 
construction  time  is  0{n2\ogn),  and  we  get  the  following  result. 

Theorem  12.2  A  BSP  of  size  0(n  log/;)  can  be  computed  in  expected  time 
0(n2logn). 


Although  the  expected  size  of  the  BSP  that  is  constructed  by  2dRan- 
DOMBsp  is  fairly  good,  the  running  time  of  the  algorithm  is  somewhat  disap¬ 
pointing.  In  many  applications  this  is  not  so  important,  because  the  construction 
is  done  off-line.  Moreover,  the  construction  time  is  only  quadratic  when  the  BSP 
is  very  unbalanced,  which  is  rather  unlikely  to  occur  in  practice.  Nevertheless, 
from  a  theoretical  point  of  view  the  construction  time  is  disappointing.  Using  an 
approach  based  on  segment  trees — see  Chapter  10 — this  can  be  improved:  one 
can  construct  a  BSP  of  size  O(nlogn)  in  O(nlogn)  time  with  a  deterministic 
algorithm.  This  approach  does  not  give  an  auto-partition,  however,  and  in 
practice  it  produces  BSPs  that  are  slightly  larger. 

A  natural  question  is  whether  the  size  of  the  BSP  generated  by  2dRan- 
DOMBsp  can  be  improved.  Would  it  for  example  be  possible  te  devise  an 
algorithm  that  produces  a  BSP  of  size  0(n)  for  any  set  of  11  disjoint  segments 
in  the  plane?  The  answer  is  no:  there  are  sets  of  segments  for  which  any  BSP 
must  have  size  f2(«logn/loglogn).  Note  that  the  algorithm  we  presented  does 
not  achieve  this  bound,  so  there  may  still  be  room  for  a  slight  improvement. 


The  algorithm  we  described  for  the  planar  case  generalizes  to  3-dimensional 
space.  Let  S  be  a  set  of  n  non-intersecting  triangles  in  R3.  Again  we  restrict 
ourselves  to  auto-partitions,  that  is,  we  only  use  partition  planes  containing  a 
triangle  of  S.  For  a  triangle  t  we  denote  the  plane  containing  it  by  hit). 


Algorithm  3 1)  B  S  P(.S’) 

Input.  A  set  S  =  ■  ■  ■  ,t„}  of  triangles  in  M3. 

Output.  A  BSP  tree  for  S. 

1.  ifcard(S)^l 

2.  then  Create  a  tree  7  consisting  of  a  single  leaf  node,  where  the  set  S  is 

stored  explicitly. 

3.  return  7 

4.  else  (*  Use  h(t\)  as  the  splitting  plane.  *) 

5.  S+^-{fn/!(fi)+:f<ES};  T+  ^3dBsp(S+) 

6.  S~  <^{tnh(h)-  ^3dBsp(S~) 

7.  Create  a  BSP  tree  7  with  root  node  v,  left  subtree  7  ,  right 
subtree  T+,  and  with  S(v)  =  {t  €  S  :  t  C  /z(fi)}. 

8.  return  7 


The  size  of  the  resulting  BSP  again  depends  on  the  order  of  the  triangles;  some 
orders  give  more  fragments  than  others.  As  in  the  planar  case,  we  can  try  to 
get  a  good  expected  size  by  first  putting  the  triangles  in  a  random  order.  This 
usually  gives  a  good  result  in  practice.  However,  it  is  not  known  how  to  analyze 
the  expected  behavior  of  this  algorithm  theoretically.  Therefore  we  will  analyze 
a  variant  of  the  algorithm  in  the  next  section,  although  the  algorithm  described 
above  is  probably  superior  in  practice. 
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12.4*  The  Size  of  BSP  Trees  in  3-Space 

The  randomized  algorithm  for  constructing  a  BSP  tree  in  3-space  that  we  analyze 
in  this  section  is  almost  the  same  as  the  improved  algorithm  described  above:  it 
treats  the  triangles  in  random  order,  and  it  makes  free  splits  whenever  possible. 
A  free  split  now  occurs  when  a  triangle  of  S  splits  a  cell  into  two  disconnected 
subcells.  The  only  difference  is  that  when  we  use  some  plane  h(t)  as  a  splitting 
plane,  we  use  it  in  all  cells  intersected  by  that  plane,  not  just  in  the  cells  that 
are  intersected  by  t.  (And  therefore  a  simple  recursive  implementation  is  no 
longer  possible.)  There  is  one  exception  to  the  rule  that  we  split  all  cells  with 
h(t):  when  the  split  is  completely  useless  for  a  cell,  because  all  the  triangles  in 
that  cell  lie  completely  to  one  side  of  it,  then  we  do  not  split  it. 

Figure  12.5  illustrates  this  on  a  2-dimensional  example.  In  part  (a)  of 
the  figure,  the  subdivision  is  shown  that  is  generated  by  the  algorithm  of  the 
previous  section  after  treating  segments  si,  S2,  and  S3  (in  that  order).  In  part  (b) 
the  subdivision  is  shown  as  generated  by  the  modified  algorithm.  Note  that  the 
modified  algorithm  uses  f(s2)  as  a  splitting  line  in  the  subspace  below  £(si), 
and  that  £($3)  is  used  as  a  splitting  line  in  the  subspace  to  the  right  of  i(s2).  The 
line  l(si)  is  not  used  in  the  subspace  between  f?(si)  and  (.{sf),  however,  because 
it  is  useless  there. 


Figure  12.5 

The  original  and  the  modified  algorithm 


The  modified  algorithm  can  be  summarized  as  follows.  Working  out  the 
details  is  left  as  an  exercise. 

Algorithm  3dRandomBsp2(S) 

Input.  A  set  S  =  {t  1  ,?2, . . .  ,tn}  of  triangles  in  M3. 

Output.  A  BSP  tree  for  S. 

1 .  Generate  a  random  permutation  t\ , . . . ,  tn  of  the  set  S. 

2.  for  i  <—  1  to  n 

3.  do  Use  h(ti)  to  split  every  cell  where  the  split  is  useful. 

4.  Make  all  possible  free  splits. 

The  next  lemma  analyzes  the  expected  number  of  fragments  generated  by  the 
algorithm. 

Lemma  12.3  The  expected  number  of  object  fragments  generated  by  algorithm 
3dRandomBsp2  over  all  n\  possible  permutations  is  0(n2). 
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Proof.  We  shall  prove  a  bound  on  the  expected  number  of  fragments  into 
which  a  fixed  triangle  £  S  is  cut.  For  a  triangle  t,  with  i  <  k  we  define 


£i  :=  h(ti )  fl/i(4).  The  set  L\=  1 }  is  a  set  of  at  most  k—  1  lines 

lying  in  the  plane  /i(4).  Some  of  these  lines  intersect  4,  others  miss  4.  For 
a  line  /,  that  intersects  4  we  define  4  :=  £,  ft  4- .  Let  /  be  the  set  of  all  such 
intersections  4.  Due  to  free  splits  the  number  of  fragments  into  which  4  is  cut 
is  in  general  not  simply  the  number  of  faces  in  the  arrangement  that  I  induces 
on  4.  To  understand  this,  consider  the  moment  that  4_i  is  treated.  Assume  that 
£k~\  intersects  4;  otherwise  4_i  does  not  cause  any  fragmentation  on  4.  The 
segment  1  can  intersect  several  of  the  faces  of  the  arrangement  on  4  induced 
by  i'\  { .sy, } .  If,  however,  such  a  face  /  is  not  incident  to  the  one  of  the  edges  of 
4 — we  call  /  an  interior  face — then  a  free  split  already  has  been  made  through 
this  part  of  4.  In  other  words,  /t(4_i)  only  causes  cuts  in  exterior  faces,  that 
is,  faces  that  are  incident  to  one  of  the  three  edges  of  4.  Hence,  the  number  of 
splits  on  4  caused  by  /z(4_  1 )  equals  the  number  of  edges  that  contributes 
to  exterior  faces  of  the  arrangement  on  4  induced  by  I.  (In  the  analysis  that 
follows,  it  is  important  that  the  collection  of  exterior  faces  is  independent  of 
the  order  in  which  4, . . . ,  4— 1  have  been  treated.  This  is  not  the  case  for  the 
algorithm  in  the  previous  section,  which  is  the  reason  for  the  modification.) 
What  is  the  expected  number  of  such  edges?  To  answer  this  question  we  first 
bound  the  total  number  of  edges  of  the  exterior  faces. 

In  Chapter  8  we  defined  the  zone  of  a  line  t  in  an  arrangement  of  lines  in 
the  plane  as  the  set  of  faces  of  the  arrangement  intersected  by  l.  You  may  recall 
that  for  an  arrangement  of  m  lines  the  complexity  of  the  zone  is  0(m).  Now  let 
e\,  and  ej,  be  the  edges  of  4  and  let  £{ei)  be  the  line  through  e for  i  =  1,2,3. 
The  edges  that  we  are  interested  in  must  be  in  the  zone  of  either  £{e  1),  £(ej), 
or  £(e{)  in  the  arrangement  induced  by  the  set  L  on  the  plane  /z(4).  Hence,  the 
total  number  of  edges  of  exterior  faces  is  O(k). 

If  the  total  number  of  edges  of  exterior  faces  is  0(k),  then  the  average 
number  of  edges  lying  on  a  segment  ,v,  is  0(1).  Because  4 ....  ,4  is  a  random 
permutation,  so  is  4,.. . ,  4—  1  -  Hence,  the  expected  number  of  edges  on  segment 
Sk- 1  is  constant,  and  therefore  the  expected  number  of  extra  fragments  on  4 
caused  by  /i(4-i)  is  0(1).  The  same  argument  shows  that  the  expected  number 
of  fragmentations  on  4  generated  by  each  of  the  splitting  planes  h(t\)  through 
h(tk~ 2)  is  constant.  This  implies  that  the  expected  number  of  fragments  into 
which  4  is  cut  is  0(k).  The  total  number  of  fragments  is  therefore 
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0(£  k)  =  0{n2). 


k=  1 


□ 


The  quadratic  bound  on  the  expected  size  of  the  partitioning  generated  by 
3dRandomBsp  immediately  proves  that  a  BSP  tree  of  quadratic  size  exists. 


You  may  be  a  bit  disappointed  by  the  bound  that  we  have  achieved.  A  quadratic 
size  BSP  tree  is  not  what  you  are  hoping  for  when  you  have  a  set  of  10,000 
triangles.  The  following  theorem  tells  us  that  we  cannot  hope  to  prove  anything 
better  if  we  restrict  ourselves  to  auto-partitions. 

Lemma  12.4  There  are  sets  of  n  non-intersecting  triangles  in  3-space  for  which 
any  auto-partition  has  size  kl(n2). 
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Proof.  Consider  a  collection  of  rectangles  consisting  of  a  set  R\  of  rectangles 
parallel  to  the  xv-planc  and  a  set  If  of  rectangles  parallel  to  the  yz-plane,  as 
illustrated  in  the  margin.  (The  example  also  works  with  a  set  of  triangles, 
but  with  rectangles  it  is  easier  to  visualize.)  Let  n\  :=  card(f?i),  let  «2  := 
cardf/C),  and  let  G{n\  ,nf)  be  the  minimum  size  of  an  auto-partition  for  such 
a  configuration.  We  claim  that  G(n\  .nf)  =  (n i  +  1  )(«2  +  1)  —  1.  The  proof  is 
by  induction  on  ni  +n 2.  The  claim  is  obviously  true  for  G(1,0)  and  G( 0, 1),  so 
now  consider  the  case  where  n  1  +«2  >  1.  Without  loss  of  generality,  assume 
that  the  auto-partition  chooses  a  rectangle  r  from  the  set  R\ .  The  plane  through 
r  will  split  all  the  rectangles  in  If.  Moreover,  the  configurations  in  the  two 
subscenes  that  must  be  treated  recursively  have  exactly  the  same  form  as  the 
initial  configuration.  If  m  denotes  the  number  of  rectangles  of  R\  lying  above  r, 
then  we  have 

G(n\  .112)  =  1  +G(m,n2)  +  G(n  1  —  m—  1,712) 

=  1  +  ((m+  l)(/t2  +  1)  —  1)  +  ((ni  —  m)(n2+  1)  —  1) 

=  («i  +  1)(«2+  1)  —  1.  ED 


So  perhaps  we  should  not  restrict  ourselves  to  auto-partitions.  In  the  lower 
bound  in  the  proof  of  Lemma  12.4  the  restriction  to  auto-partitions  is  definitely 
a  bad  idea:  we  have  shown  that  such  a  partition  necessarily  has  quadratic  size. 


whereas  we  can  easily  get  a  linear  BSP  if  we  first  separate  the  set  R\  from  the 
set  R2  with  a  plane  parallel  to  the  xz- plane.  But  even  unrestricted  partitions  fail 
to  give  a  small  BSP  for  the  configuration  of  Figure  12.6.  This  configuration  is 
obtained  as  follows.  We  start  by  taking  a  grid  in  the  plane  made  up  of  n/2  lines 
parallel  to  the  x-axis  and  n/2  lines  parallel  to  the  y-axis.  (Instead  of  the  lines  we 
could  also  take  very  long  and  skinny  triangles.)  We  skew  these  lines  a  little  to 
get  the  configuration  of  Figure  12.6;  the  lines  now  lie  on  a  so-called  hyperbolic 
paraboloid.  Finally  we  move  the  lines  parallel  to  the  y-axis  slightly  upward  so 
that  the  lines  no  longer  intersect.  What  we  get  is  the  set  of  lines 

{y  =  i,z=  ix  :  1  ^  i  ^  n/2}  U  {x  =  i,  z  =  iy  +  £  ■  1  ^  i  fn/2}, 

where  e  is  a  small  positive  constant.  If  e  is  sufficiently  small  then  any  BSP 
must  cut  at  least  one  of  the  four  lines  that  bound  a  grid  cell  in  the  immediate 
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neighborhood  of  that  cell.  The  formal  proof  of  this  fact  is  elementary,  but  Section  12.5 

tedious  and  not  very  instructive.  The  idea  is  to  show  that  the  lines  are  skewed  in  bsp  trees  for  low-density 

such  a  way  that  no  plane  fits  simultaneously  through  the  four  “openings”  at  its  scenes 

corners.  Since  there  is  a  quadratic  number  of  grid  cells,  this  will  result  in  ®(h2) 

fragments. 

Theorem  12.5  For  any  set  of  n  non-intersecting  triangles  in  R3  a  BSP  tree  of 
size  0{n2)  exists.  Moreover,  there  are  configurations  for  which  the  size  of  any 
BSP  is  Q.(n2). 


12.5  BSP  Trees  for  Low-Density  Scenes 

In  the  previous  section,  we  described  an  algorithm  that  constructs  a  BSP  tree  for 
a  set  of  n  disjoint  triangles  in  R3.  It  always  produces  a  BSP  tree  of  size  0{n2). 
We  also  gave  an  example  of  a  set  of  n  triangles  for  which  any  BSP  tree  has 
size  fl{n2).  Hence,  the  0(n2)  upper  bound  is  tight  in  the  worst  case,  and  from  a 
theoretical  point  of  view  the  problem  seems  to  be  solved.  The  tight  quadratic 
bound  might  give  you  the  idea  that  BSP  trees  are  useless  in  practice.  Fortunately 
this  is  not  the  case:  in  many  practical  situations,  BSP  trees  perform  just  fine. 
Evidently,  the  theoretical  analysis  fails  to  predict  the  practical  performance  of 
BSP  trees. 

This  is  disturbing:  based  on  our  theoretical  analysis  we  might  have  discarded 
a  structure  that  is  quite  useful  in  practice.  The  problem  is  that  certain  inputs — 
the  grid-like  lower-bound  construction,  for  example — force  a  BSP  to  cut  may 
objects,  while  other  inputs — the  ones  that  usually  occur  in  practice — admit 
a  BSP  that  cuts  only  few  objects.  We  would  like  our  analysis  to  reflect  this: 
it  should  give  different  bounds  for  different  types  of  input.  This  means  that 
we  can  no  longer  do  the  analysis  in  terms  of  the  input  size,  n,  only.  We  must 
introduce  another  parameter,  which  distinguishes  easy  inputs  from  difficult  ones. 
What  are  easy  inputs?  Intuitively,  easy  inputs  are  inputs  where  the  objects  are 
relatively  well  separated,  whereas  difficult  inputs  have  many  objects  packed 
closely  together.  Note  that  whether  or  not  objects  are  close  to  each  other  is  not 
a  matter  of  the  absolute  distance  between  them,  but  of  the  distance  relative  to 
their  size;  otherwise,  scaling  the  whole  scene  would  lead  to  different  results, 
which  is  undesirable.  Therefore  we  define  our  parameter,  which  we  will  call  the 
density  of  a  scene,  as  follows. 

Let  diam(o)  denote  the  diameter  of  an  object  o.  The  density  of  a  set  S  of 
objects  in  Rf/  is  defined  as  the  smallest  number  A  such  that  the  following  holds: 
any  ball  B  intersects  at  most  A  objects  o  £  S  such  that  diam(o)  ^  diam(Tf). 
Figure  12.7  illustrates  this  definition.  Note  that  the  definition  speaks  of  any 
ball:  B  is  not  an  object  in  S,  but  a  ball  of  arbitrary  radius  whose  center  can  be 
anywhere  in  space. 

It  is  easy  to  come  up  with  a  set  of  n  objects  whose  density  is  n:  any  set  of 
lines  will  do.  When  the  objects  are  bounded,  the  density  can  still  be  high.  The 
grid-like  construction  of  Figure  12.6,  for  example,  has  density  ©(h)  even  if  the 
objects  in  the  construction  are  line  segments  rather  than  full  lines.  On  the  other 
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A  set  of  eight  segments  with  density  3 
The  disc  B  intersects  five  segments,  but 
two  of  them  have  diameter  less  than 
diam(S)  and  so  they  are  not  counted 


hand,  the  density  can  also  be  quite  low:  a  set  of  n  unit  balls  such  that  any  two 
balls  are  more  than  unit  distance  apart  has  density  1 .  In  fact,  one  can  prove  that 
any  set  of  n  disjoint  balls  has  density  0(1),  even  if  they  have  vastly  different 
sizes — see  Exercise  12.13. 

Let’s  see  where  we  stand.  We  have  defined  a  parameter,  density,  that  captures 
the  difficulty  of  a  scene  in  the  following  sense:  if  the  density  is  low  then  the 
objects  are  reasonably  well  separated,  and  if  the  density  is  high  then  there  are 
regions  with  many  objects  close  together.  Next  we  want  to  show  that  if  the 
density  is  low — a  constant  independent  of  n,  for  instance — then  we  can  find  a 
small  BSP.  One  possibility  could  be  to  analyze  the  randomized  algorithm  of  the 
previous  section  more  carefully,  and  show  that  it  produces  a  small  BSP  if  the 
density  of  the  input  set  is  low.  Unfortunately,  this  is  not  the  case:  even  for  inputs 
of  low  density,  it  may  produce  a  BSP  whose  expected  size  is  quadratic.  In  other 
words,  the  algorithm  fails  to  always  take  advantage  of  the  situation  when  the 
input  scene  is  easy.  We  need  a  new  algorithm. 

Let  S  be  a  set  of  objects  in  R2 — S  can  contain  segments,  discs,  triangles,  etc. — 
and  let  A  be  the  density  of  S.  (The  algorithm  presented  below  also  works  in 
R3  and.  in  fact,  even  in  higher  dimensions.  For  simplicity  we  shall  confine 
ourselves  to  R2  from  now  on.  )  The  idea  behind  the  algorithm  is  to  define,  for 
each  object  o  £  S,  a  small  set  of  points — we  call  them  guards — such  that  the 
distribution  of  the  guards  is  representative  of  the  distribution  of  the  objects,  and 
then  to  let  the  construction  of  the  BSP  be  guided  by  the  guards.  We  shall  now 
make  this  idea  precise. 

Let  bb(o)  denote  the  bounding  box  of  o,  that  is,  bb(o)  is  the  smallest  axis- 
aligned  rectangle  that  contains  o.  The  guards  that  we  define  for  o  are  simply 
the  four  vertices  of  bb(o).  Let  G(S')  be  the  multiset  of  4n  guards  defined  for  the 
objects  in  S.  (G(S)  is  a  multiset  because  bounding-box  vertices  can  coincide. 
When  this  happens,  we  want  those  guards  to  be  put  multiple  times  into  G(S), 
once  for  each  object  of  which  they  are  a  bounding-box  vertex.)  When  S  has 
low  density,  the  guards  in  G(S)  are  representative  of  the  distribution  of  S  in 
the  following  sense:  for  any  square  a,  the  number  of  objects  intersecting  o  is 
not  much  more  than  the  number  of  guards  inside  a.  The  next  lemma  makes 
this  precise.  Note  that  this  lemma  gives  only  an  tipper  bound  on  the  number  of 
objects  intersecting  a  square,  not  a  lower  bound:  it  is  very  well  possible  that 
a  square  contains  many  guards  without  intersecting  a  single  object.  Note  also 
that  even  though  the  definition  of  density  (in  2D)  uses  discs,  the  property  of  the 
guards  given  in  the  following  lemma  is  with  respect  to  squares. 


Lemma  12.6  Any  axis-parallel  square  that  contains  k  guards  from  G(S)  in  its  Section  12.5 

interior  intersects  at  most  k  +  4A  objects  from  S.  bsp  trees  for  low-density 

SCENES 

Proof.  Let  o  be  an  axis-parallel  square  with  k  guards  in  its  interior.  Obviously 
there  are  at  most  k  objects  that  have  a  guard  (that  is,  a  bounding-box  vertex) 
inside  a.  Define  S'  to  be  the  set  of  the  remaining  objects  from  S,  that  is,  the 
ones  without  a  guard  inside  a.  Clearly,  the  density  of  S'  is  at  most  A.  We  have 
to  show  that  at  most  4A  objects  from  S'  can  intersect  a. 


y 


Figure  12.8 

The  square  a  intersects  a  segment  but 
does  not  contain  a  guard.  Hence,  the 
diameter  of  the  segment  is  at  least  the 
x  edge  length  of  a 


If  an  object  o  £  S'  intersects  cr  then,  obviously,  bb(o)  intersects  a  as  well. 
By  the  definition  of  S',  the  square  a  does  not  contain  a  vertex  of  bb(o)  in  its 
interior.  But  then  the  projection  of  bb(o)  onto  the  x-axis  contains  the  projection 
of  cr  onto  the  x-axis,  or  the  projection  of  bb(o)  onto  the  j'-axis  contains  the 
projection  of  a  onto  the  y-axis  (or  both) — see  Figure  12.8.  This  implies  that  the 
diameter  of  o  is  at  least  the  side  length  of  cr,  and  so  diam(o)  ^  diam(cr) / \/2. 
Now  cover  cr  with  four  discs  D\, . . .  ,£>4  of  diameter  diam(cr) /2.  The  object  o 
intersects  at  least  one  of  these  discs,  D,.  We  charge  o  to  D,.  We  have 

diam(o)  ^  diam(cr)/v/2  >  diam(cr)/2  =  diam(Z),). 

Since  S'  has  density  at  most  A,  each  Z),  is  charged  at  most  A  times.  Hence,  cr 
intersects  at  most  4A  objects  from  S' .  Adding  the  at  most  k  objects  not  in  S',  we 
find  that  cr  intersects  at  most  k  +  4A  objects  from  S.  0 

Lemma  12.6  suggests  the  following  two-phase  algorithm  to  construct  a  BSP. 
Let  U  be  a  square  that  contains  all  objects  from  S  in  its  interior. 

In  the  first  phase,  we  recursively  subdivide  U  into  squares  until  each  square 
contains  at  most  one  guard  in  its  interior.  In  other  words,  we  construct  a  quadtree 
on  G(s) — see  Chapter  14.  Partitioning  a  square  into  its  four  quadrants  can  be 
done  by  first  splitting  the  square  into  two  equal  halves  with  a  vertical  line  and 
then  splitting  each  half  with  a  horizontal  line.  Hence,  the  quadtree  subdivision 
gives  us  a  BSP  tree  on  the  set  G(S).  Figure  12.9  illustrates  this.  Notice  that 
some  splitting  lines,  £ 2  and  ^3  for  instance,  are  in  fact  the  same  line;  what  differs 
is  which  portion  of  the  line  is  relevant,  but  this  information  is  not  stored  with 
the  nodes  in  the  BSP  tree.  By  Lemma  12.6,  each  leaf  region  in  the  resulting 
subdivision  intersects  only  a  few  objects — at  most  1  +4A,  to  be  precise.  The 
second  phase  of  the  algorithm  then  partitions  each  leaf  region  further,  until  all 
objects  are  separated.  How  the  second  phase  is  done  exactly  depends  on  the 
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BINARY  SPACE  PARTITIONS 


Figure  12.9 
A  quadtree  subdivision  and  the 
corresponding  BSP  tree. 
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type  of  objects  in  S.  If  the  objects  are  line  segments,  for  example,  we  can  apply 
Algorithm  2dRandomBSP  given  in  Section  12.3  to  the  segment  fragments  in 
each  leaf  region. 


The  crucial  property  of  the  algorithm  we  just  sketched  is  that  for  low-density 
scenes  the  first  phase  produces  leaf  regions  that  intersect  few  objects.  Unfortu¬ 
nately,  there  is  one  problem:  the  number  of  leaf  regions  can  be  very  large.  This 
happens  for  example  when  two  guards  lie  very  close  together  near  a  corner  of 
the  initial  square  U.  Therefore  we  modify  the  first  phase  of  the  algorithm  to 
guarantee  it  produces  a  linear  number  of  leaf  regions.  This  is  done  as  follows. 

The  first  modification  is  that  we  do  not  continue  subdividing  until  each 
region  has  only  one  guard  in  it,  but  instead  we  stop  when  the  region  contains  k 
or  fewer  guards,  for  some  suitable  parameter  k  ^  1.  The  reason  for  this  and  the 
choice  of  k  will  be  discussed  later. 

The  second  modification  is  the  following.  Suppose  that  in  the  recursive 
subdivision  procedure  we  have  to  subdivide  a  square  a.  Consider  the  four 
quadrants  of  a.  If  at  least  two  of  them  contain  more  than  k  guards  in  their 
interior,  then  we  proceed  as  before  by  applying  a  quadtree  split,  we  partition 
a  into  its  four  quadrants  by  first  splitting  it  with  a  vertical  line  tv(o)  and 
then  splitting  each  half  with  a  horizontal  line  //,(T) — see  Figure  12.10.  After 
applying  the  quadtree  split,  we  recurse  on  the  quadrants.  If  none  of  the  quadrants 


Figure  12.10 
Examples  of  a  quadtree  split  and  a 
shrinking  step  with  k  =  4 


quadtree  split  shrinking  step 


•  • 

• 

• 

• 

• 

4  (0) 

i 

tv(G) 

w 

• 

• 

• 

• 

contains  more  than  k  guards,  we  also  perform  a  quadtree  split;  in  this  case  the 
four  quadrants  all  become  leaf  regions.  If  there  is  exactly  one  quadrant,  say  g'. 
with  more  than  k  guards  in  it  we  have  to  be  careful:  all  guards  could  be  very 
close  to  a  corner,  and  then  it  may  take  many  quadtree  splits  before  we  finally 
separate  them — see  also  Lemma  14.1.  Therefore  we  perform  a  shrinking  step. 
Intuitively,  we  shrink  G1  until  at  least  k  guards  are  not  in  the  interior  of  G1 .  More 
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precisely,  a  shrinking  step  proceeds  as  follows.  Assume  that  o'  is  the  north-west  Section  12.5 

quadrant  of  <7;  the  other  three  cases  are  handled  in  a  symmetrical  fashion.  We  bsp  trees  for  low-density 

shrink  o'  by  moving  its  bottom-right  corner  diagonally  to  the  north-west — o'  scenes 

thus  remains  a  square  during  the  shrinking  process — until  at  least  k  guards  are 

outside  the  interior  of  o' .  With  a  slight  abuse  of  notation,  we  use  o'  to  denote 

the  shrunk  quadrant.  Notice  that  o'  has  at  least  one  guard  on  its  boundary.  We 

partition  a  by  first  splitting  it  with  a  vertical  line  £v(o)  through  the  right  edge  of 

o' ,  and  then  splitting  the  two  resulting  parts  with  a  horizontal  line  £h(o)  through 

the  bottom  edge  of  o' — see  Figure  12.10.  This  partitions  o  into  four  regions, 

two  of  which  are  squares.  In  particular  o',  the  only  region  that  can  contain  more 

than  k  guards  and  therefore  may  have  to  be  split  further,  is  a  square. 

Algorithm  PHASE  1  summarizes  the  recursive  splitting  procedure. 

Algorithm  Phase l(o,G,k) 

Input.  A  region  o,  a  set  G  of  guards  in  the  interior  of  o,  and  an  integer  k  ^  1 . 

Output.  A  BSP  tree  T  such  that  each  leaf  region  contains  at  most  k  guards. 

1.  ifcard(G)^k 

2.  then  Create  a  BSP  tree  7  consisting  of  a  single  leaf  node. 

3.  else  if  exactly  one  quadrant  of  o  contains  more  than  k  guards  in  its 

interior 

4.  then  Determine  the  splitting  lines  £v(o)  and  £},(o)  for  a  shrink¬ 

ing  step,  as  explained  above. 

5.  else  Determine  the  splitting  lines  £v(o)  and  4(<?)  f°r  a  quadtree 

split,  as  explained  above. 

6.  Create  a  BSP  tree  T  with  three  internal  nodes;  the  root  of  T  stores 
£v(o)  as  its  splitting  line,  and  both  children  of  the  root  store  £h(o) 
as  their  splitting  line. 

7.  Replace  each  leaf  /i  of  7  by  a  BSP  tree  7fl  computed  recursively 
on  the  region  corresponding  to  /i  and  the  guards  inside  that  region. 

8.  return  T 


Lemma  12.7  PhaslI  (U  .G(S).k)  produces  a  BSP  tree  with  0(n/k )  leaves, 
where  each  leaf  region  intersects  at  mostk  +  4A  objects. 


Proof.  We  first  prove  the  bound  on  the  number  of  leaves.  This  number  is  one 
more  than  the  number  of  internal  nodes,  and  so  it  suffices  to  bound  the  latter 
number. 

Let  N(m)  denote  the  maximum  number  of  internal  nodes  in  a  BSP  tree  cre¬ 
ated  by  Phase1(<7,  G,  k )  when  card(G)  =  m.  If  m  f  k,  no  splits  are  performed, 
and  so  N(m)  =  0  in  this  case.  Otherwise,  a  quadtree  split  or  a  shrinking  step 
is  applied  to  o.  This  results  in  three  internal  nodes,  and  four  regions  in  which 
we  recurse.  Let  mi, . . .  ,m 4  denote  the  numbers  of  guards  in  the  four  regions, 
and  let  /:={/:  1  f  i  f  4  and  m,  >  k\ .  Since  a  region  that  contains  k  or  fewer 
guards  is  a  leaf  region,  we  know  that  N (m,)  =  0  for  i  f  I.  Hence, 


f  0  if  m  f  k 

\  3  +  L/e/ A(m,)  otherwise 


We  will  prove  by  induction  that  N(m)  ^  max(0,  (6m /k)  —  3).  This  is  obviously 
true  for  m  f  k,  and  so  we  now  assume  that  m  >  k.  A  guard  can  be  in  the  interior 
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Chapter  12  of  at  most  one  region,  which  means  that  ^  m.  If  at  least  two  quadrants 

binary  space  partitions  of  ct  contain  more  than  k  guards,  then  card(7)  ^  2  and  we  have 

N(m)  <  3  +  ^/V(m,)  <  3  +  (  ^(6m,/&)  )  —  card(7)  •  3  <  6m/k  —  3, 
iei  \iei  J 

as  claimed.  If  none  of  the  quadrants  contains  more  than  k  guards,  then  the 
four  regions  are  all  leaf  regions  and  N(m)  =  3.  Together  with  the  assumption 
m  >  k,  this  implies  N (in)  (6m /k)  —  3.  The  remaining  case  is  where  exactly 

one  quadrant  contains  more  than  k  guards.  In  this  case  we  do  a  shrinking  step. 

Because  of  the  way  a  shrinking  step  is  performed,  a  shrunk  quadrant  contains 
fewer  than  m  —  k  guards  and  the  other  resulting  regions  contain  at  most  k  guards. 
Hence,  in  this  case  we  have 

N(m)  ^3  +  N(m  —  k)  ^  3  +  (6(m  —  k)/k  —  3)  ^  6m/k  —  3. 

So  in  all  cases  N(m)  ^  (6m /k)  —  3,  as  claimed.  This  proves  the  bound  on  the 
number  of  internal  nodes. 

It  remains  to  prove  that  each  leaf  region  intersects  at  most  k  +  AX  objects.  By 
construction,  a  leaf  region  contains  at  most  k  guards  in  its  interior.  Hence,  if 
a  leaf  region  is  a  square.  Lemma  12.6  implies  that  it  intersects  k  +  AX  objects. 
There  can  also  be  leaf  regions,  however,  that  are  not  square,  and  then  we 
cannot  apply  Lemma  12.6  directly.  A  non-square  leaf  region  a"  must  have 
been  produced  in  a  shrinking  step,  as  shown  in  Figure  12.10.  Recall  that  the 
shrinking  process  stops  as  soon  as  k  or  more  guards  are  not  in  the  interior  of  the 
shrunk  quadrant  o' .  At  least  one  of  these  guards  must  be  on  the  boundary  of 
<x',  and  in  fact  the  number  of  guards  exterior  to  a'  (not  counting  the  guards  on 
the  boundary  of  a')  must  be  less  than  k.  This  implies  that  a"  can  be  covered 
by  a  square  with  at  most  k  guards  in  its  interior;  this  square  is  shown  in  gray  in 
the  figure  in  the  margin.  By  Lemma  12.6,  this  square  intersects  at  most  k  +  AX 
objects,  and  so  a"  intersects  at  most  that  many  objects  as  well.  0 

Lemma  12.7  explains  why  it  can  be  advantageous  to  use  a  value  for  k  that  is 
larger  than  1 :  the  larger  k  is,  the  fewer  leaf  regions  we  will  have.  On  the  other 
hand,  a  larger  k  will  also  mean  more  objects  per  leaf  region.  A  good  choice  of 
k  is  therefore  one  that  reduces  the  number  of  leaf  regions  as  much  as  possible, 
without  significantly  increasing  the  number  of  objects  per  leaf  region.  Setting 
k  :=  A  will  do  this:  the  number  of  leaf  regions  will  decrease  by  a  factor  A  (as 
compared  with  k  =  1),  while  the  maximum  number  of  objects  per  leaf  region 
does  not  increase  asymptotically — it  only  goes  from  1  +4A  to  5A. 

There  is  one  problem,  however:  we  do  not  know  A,  the  density  of  the  input 
scene,  and  so  we  cannot  use  it  as  a  parameter  in  the  algorithm.  Therefore  we 
use  the  following  trick.  We  guess  a  small  value  for  A,  say  A  =  2.  Then  we 
run  PHASE  1  with  our  guess  as  the  value  of  k,  and  we  check  whether  each  leaf 
region  in  the  resulting  BSP  tree  intersects  at  most  5k  objects.  If  so,  we  proceed 
with  the  second  phase  of  the  algorithm;  otherwise,  we  double  our  guess  and  try 
again.  This  leads  to  the  following  algorithm. 
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Section  12.5 

BSP  TREES  FOR  LOW-DENSITY 
SCENES 


When  the  input  S  consists  of  non-intersecting  segments  in  the  plane,  we  can  use 
2dRandomBsp  to  compute  the  BSPs  in  line  9,  leading  to  the  following  result. 

Theorem  12.8  For  any  set  S  of  n  disjoint  line  segments  in  the  plane,  there  is  a 
BSP  of  size  0(n  log  A),  where  A  is  the  density  ofS. 

Proof.  By  Lemma  12.7,  PHASE  l(t/,G(S),  k)  results  in  a  BSP  tree  where  each 
leaf  region  intersects  at  most  A  +  4A  objects.  Hence,  the  test  in  line  7  of 
LowDensityBSP2d  is  guaranteed  to  be  false  if  k  ^  A.  (If  k  <  A,  the  test 
may  or  may  not  be  false.)  The  while-loop  therefore  ends,  at  the  latest,  when  k 
becomes  larger  than  A  for  the  first  time.  Since  k  doubles  every  time,  this  implies 
k  ^  2 A  when  we  get  to  the  second  phase  in  line  8. 

Let  k*  denote  the  value  of  k  when  we  get  to  line  8.  We  have  just  argued  that 
k*  ^  2A .  The  test  in  line  7  guarantees  that  each  leaf  region  intersects  at  most  5k* 
segments.  Hence,  according  to  Lemma  12.1,  each  tree  ‘.T[(  has  (expected)  size 
0(k*  logk*)  when  2dRandomBsp  is  used  in  line  9.  Because  there  are  0(n/k*) 
leaf  regions,  the  total  size  of  the  BSP  tree  is  0(n  log/C).  Since  k*  f  2A,  this 
proves  the  theorem.  0 

The  bound  in  Theorem  12.8  is  never  worse  than  O(nlogn).  In  other  words, 
the  algorithm  described  above  is  as  good  as  the  algorithm  given  in  Section  12.3 
in  the  worst  case,  but  it  provably  profits  when  the  density  of  the  input  is  low. 

Recall  that  the  reason  for  introducing  the  concept  of  density  was  the  quadratic 
worst-case  bound  for  BSPs  for  triangles  in  R3.  The  algorithm  that  we  have  just 
described  works  very  well  for  segments  in  the  plane:  it  produces  a  BSP  whose 
size  is  O(nlogn)  in  the  worst  case  and  0(n)  when  the  density  of  the  input  is 
a  constant.  What  happens  when  we  apply  this  approach  to  a  set  of  triangles 
in  R3?  As  it  turns  out,  it  also  leads  to  good  results  in  this  case,  as  stated  in  the 
next  theorem.  (Exercise  12.18  asks  you  to  prove  this  theorem.) 


Algorithm  LowDensityBSP2d(S) 

Input.  A  set  S  of  n  objects  in  the  plane. 

Output.  A  BSP  tree  T  for  S. 

1.  Let  G(S)  be  the  set  of  An  bounding-box  vertices  of  the  objects  in  5. 

2.  k  <—  1;  done  <—  false;  U  <—  a  bounding  square  of  S 

3.  while  not  done 

4.  dok<—  2k',  T  <—  Phase l(f/,G(S),£);  done  <—  true 

5.  for  each  leaf  p  of  T 

6.  do  Compute  the  set  S(p)  of  object  fragments  in  the  region  of  p. 

7.  if  card(.S’('/i))  >  5k  then  done  <—  false 

8.  for  each  leaf  p  of  7 

9.  do  Compute  a  BSP  tree  for  .Sip )  and  replace  p  by  Tu . 

10.  return  T 


Theorem  12.9  For  any  set  S  ofn  disjoint  triangles  in  R3,  there  is  a  BSP  of  size 
0(nX),  where  A  is  the  density  of  S . 
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Chapter  12  The  bound  in  Theorem  12.9  interpolates  nicely  between  0{n)  and  0{n2) 
binary  space  partitions  as  A  varies  from  1  to  /;.  Therefore  the  algorithm  produces  a  BSP  whose  size 

is  optimal  in  the  worst  case.  But  the  result  is  even  stronger:  the  Of/; A)  bound 
is  optimal  for  all  values  of  A :  for  any  zz  and  any  A  with  1  ^  A  ^  /;  there  is  a 
collection  of  zz  triangles  in  R3  whose  density  is  A  and  for  which  any  BSP  must 
have  size  f2(z;A). 


12.6  Notes  and  Comments 

BSP  trees  are  popular  in  many  application  areas,  in  particular  in  computer 
graphics.  The  application  mentioned  in  this  chapter  is  to  performing  hidden- 
surface  removal  with  the  painter’s  algorithm  [185],  Other  applications  include 
shadow  generation  [124],  set  operations  on  polyhedra  [292,  370],  and  visibility 
preprocessing  for  interactive  walkthroughs  [369].  BSP  trees  have  also  been  used 
in  cell  decomposition  methods  in  motion  planning  [36],  for  range  searching  [60], 
and  as  general  indexing  structure  in  GIS  [294].  Two  other  well-known  structures, 
kd-trees  and  quadtrees,  are  in  fact  special  cases  of  BSP  trees,  where  only 
orthogonal  splitting  planes  are  used.  Kd-trees  were  discussed  extensively  in 
Chapter  5  and  quadtrees  will  be  discussed  in  Chapter  14. 

The  study  of  BSP  trees  from  a  theoretical  point  of  view  was  initiated  by  Pa¬ 
terson  and  Yao  [317];  the  results  in  Sections  12.3  and  12.4  come  from  their 
paper.  They  also  proved  bounds  on  BSPs  in  higher  dimensions:  any  set  of 
(d  —  1) -dimensional  simplices  in  Rrf,  with  d  ^  3,  admits  a  BSP  of  size  0{t 
Paterson  and  Yao  also  obtained  results  for  orthogonal  objects  in  higher  dimen¬ 
sions  [318].  For  instance,  they  proved  that  any  set  of  orthogonal  rectangles  in 
R3  admits  a  BSP  of  size  0{nsfn),  and  that  this  bound  is  tight  in  the  worst  case. 
Below  we  discuss  several  of  the  results  that  have  been  obtained  since.  A  more 
extensive  overview  has  been  given  by  Toth  [373]. 

For  a  long  time  it  was  unknown  whether  any  set  of  zz  disjoint  line  segments 
in  the  plane  would  admit  a  BSP  of  size  0(n),  but  Toth  [372]  proved  that  this 
is  not  the  case,  by  constructing  a  set  of  segments  for  which  any  BSP  must 
have  size  £2(zz  log/;/ log  log/;).  Note  that  there  is  still  a  small  gap  between  this 
lower  bound  and  the  currently  known  upper  bound,  which  is  0(/;  log /;) .  There 
are  several  special  cases,  however,  where  an  0(n)  size  BSP  is  possible.  For 
example,  Paterson  and  Yao  [317]  have  shown  that  any  set  of  zz  disjoint  segments 
in  the  plane  that  are  all  either  horizontal  or  vertical  admit  a  BSP  of  size  0(n). 
The  same  result  was  achieved  by  d’Amore  and  Franciosa  [138].  Toth  [371] 
generalized  this  result  to  segments  with  a  limited  number  of  orientations.  Other 
special  cases  where  a  linear-size  BSP  is  always  possible  are  for  line  segments 
with  more  or  less  the  same  length  [54]  and.  as  we  have  already  seen  in  this 
chapter,  for  sets  of  objects  of  constant  density. 
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In  Section  12.5,  we  studied  BSPs  for  low-density  scenes.  This  was  inspired  by 
the  observation  that  the  worst-case  size  of  BSPs  for  3-dimensional  scenes  has 
little  to  do  with  their  practical  performance.  A  similar  situation  arises  frequently 


in  the  study  of  geometric  algorithms:  often  one  can  come  up  with  input  sets 
for  which  the  algorithm  at  hand  is  not  very  efficient,  but  in  many  cases  such 
input  sets  are  not  very  realistic.  This  can  have  two  disadvantages.  First,  a  worst- 
case  analysis  of  the  algorithm  may  not  be  very  informative  as  to  whether  the 
algorithm  is  useful  in  practice.  Second,  since  algorithms  are  typically  designed 
to  have  the  best  worst-case  performance,  they  may  be  geared  towards  handling 
situations  that  will  not  arise  in  practice  and  therefore  they  may  be  needlessly 
complicated.  The  underlying  reason  for  this  is  that,  usually,  the  running  time 
of  a  geometric  algorithm  not  only  is  determined  by  the  size  of  the  input  but 
also  is  strongly  influenced  by  the  shape  of  the  input  objects  and  their  spatial 
distribution.  To  overcome  this  problem,  one  can  try  to  define  a  parameter  that 
captures  the  geometry  of  the  input — just  as  we  did  in  Section  12.5. 

The  parameter  that  has  been  used  most  often  in  this  context  is  fatness.  A 
triangle  is  called  /3- fat  if  all  its  angles  are  at  least  /3.  It  has  been  shown  that 
the  complexity  of  the  union  of  n  intersecting  /3-fat  triangles  in  the  plane  is 
near-linear  in  n  if  /3  is  a  constant  [268];  currently,  the  best  known  bound  is 
<9((l//3)log(l//3)  ■  n  loglogn)  [314].  The  concept  of  fatness  has  been  gener¬ 
alized  to  arbitrary  convex  objects,  and  even  to  non-convex  objects.  One  of 
the  most  general  definitions  was  given  by  van  der  Stappen  [362],  who  defined 
an  object  o  in  LR'/  to  be  /3-fat  if  the  following  holds:  for  any  ball  B  whose 
center  lies  in  o  and  that  does  not  fully  contain  o  in  its  interior,  we  have  that 
vol(o  n  B)  ^  /3  vol(B),  where  vol(-)  denotes  the  volume.  There  are  many 
problems  that  can  be  solved  more  efficiently  for  fat  objects  than  for  general 
objects.  Examples  are  range  searching  and  point  location  [51,  60],  motion 
planning  [363],  hidden-surface  removal  [229],  ray  shooting  [21,  49,  53,  228], 
and  computing  depth  orders  [53,  228]. 

The  parameter  used  in  Section  12.5,  density,  has  also  been  studied  a  lot.  It 
can  be  shown  that  any  set  of  disjoint  /3-fat  objects  has  density  0(1 /p)  [55,  362], 
and  so  any  result  obtained  for  low-density  scenes  immediately  gives  a  result 
for  disjoint  fat  objects.  The  algorithm  for  constructing  a  BSP  for  low-density 
scenes  described  in  Section  12.5  is  a  modified  and  slightly  improved  version  of 
the  construction  by  de  Berg  [51].  Some  of  the  results  mentioned  above  for  fat 
objects  are  in  fact  based  on  this  construction  [60,  363]  and  hence  also  apply  to 
low-density  scenes. 


12.7  Exercises 

12.1  Prove  that  PaintersAlgorithm  is  correct.  That  is,  prove  that  if  (some 
part  of)  an  object  A  is  scan-converted  before  (some  part  of)  object  B  is 
scan-converted,  then  A  cannot  lie  in  front  of  B. 

12.2  Let  S  be  a  set  of  m  polygons  in  the  plane  with  n  vertices  in  total.  Let 
T  be  a  BSP  tree  for  S  of  size  k.  Prove  that  the  total  complexity  of  the 
fragments  generated  by  the  BSP  is  0(n  +  k). 
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Chapter  12  12.3  Give  an  example  of  a  set  of  line  segments  in  the  plane  where  the  greedy 

binary  space  partitions  method  of  constructing  an  auto-partition  (where  the  splitting  line  £(s)  is 

taken  that  induces  the  least  number  of  cuts)  results  in  a  BSP  of  quadratic 
size. 

12.4  Give  an  example  of  a  set  S  of  n  non-intersecting  line  segments  in  the 
plane  for  which  a  BSP  tree  of  size  n  exists,  whereas  any  auto-partition 
of  5  has  size  at  least  |_4n /3J . 

12.5  Give  an  example  of  a  set  S  of  n  disjoint  line  segments  in  the  plane  such 
that  any  auto-partition  for  S  has  depth  Q.(n). 

12.6  We  have  shown  that  the  expected  size  of  the  partitioning  produced  by 
2dRandomBSP  is  O(nlogn).  What  is  the  worst-case  size? 

12.7  Suppose  we  apply  2dRandomBSP  to  a  set  of  intersecting  line  segments 
in  the  plane.  Can  you  say  anything  about  the  expected  size  of  the 
resulting  BSP  tree? 

12.8  In  3dRandomBSP2,  it  is  not  described  how  to  find  the  cells  that 
must  be  split  when  a  splitting  plane  is  added,  nor  is  it  described  how 
to  perform  the  split  efficiently.  Work  out  the  details  for  this  step,  and 
analyze  the  running  time  of  your  algorithm. 

12.9  Give  a  deterministic  divide-and-conquer  algorithm  that  constructs  a  BSP 
tree  of  size  O(ttlogn)  for  a  set  of  n  line  segments  in  the  plane.  Hint:  Use 
as  many  free  splits  as  possible  and  use  vertical  splitting  lines  otherwise. 

12.10  Let  C  be  a  set  of  n  disjoint  unit  discs — discs  of  radius  1 — in  the  plane. 
Show  that  there  is  a  BSP  of  size  0(n)  for  C.  Hint:  Start  by  using  a 
suitable  collection  of  vertical  lines  of  the  form  x  =  2 i  for  some  integer  i. 

12.1 1  BSP  trees  can  be  used  for  a  variety  of  tasks.  Suppose  we  have  a  BSP  on 
the  edges  of  a  planar  subdivision. 

a.  Give  an  algorithm  that  uses  the  BSP  tree  to  perform  point  location  on 
the  subdivision.  What  is  the  worst-case  query  time? 

b.  Give  an  algorithm  that  uses  the  BSP  tree  to  report  all  the  faces  of  the 
subdivision  intersected  by  a  query  segment.  What  is  the  worst-case 
query  time? 

c.  Give  an  algorithm  that  uses  the  BSP  tree  to  report  all  the  faces  of  the 
subdivision  intersected  by  an  axis-parallel  query  rectangle.  What  is 
the  worst-case  query  time? 

12.12  In  Chapter  5  kd-trees  were  introduced.  Kd-trees  can  also  store  segments 
instead  of  points,  in  which  case  they  are  in  fact  a  special  type  of  BSP 
tree,  where  the  splitting  lines  for  nodes  at  even  depth  in  the  tree  are 
horizontal  and  the  splitting  lines  at  odd  levels  are  vertical. 

a.  Discuss  the  advantages  and/or  disadvantages  of  BSP  trees  over  kd- 
trees. 
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b.  For  any  set  of  two  non-intersecting  line  segments  in  the  plane  there 
exists  a  BSP  tree  of  size  2.  Prove  that  there  is  no  constant  c  such  that 
for  any  set  of  two  non-intersecting  line  segments  there  exists  a  kd-tree 
of  size  at  most  c. 

12.13  Prove  that  the  density  of  any  set  of  disjoint  discs  in  the  plane  is  at  most  9. 
(Thus  the  dependency  is  independent  of  the  number  of  discs.)  Use  this 
to  show  that  any  set  of  n  disjoint  discs  in  the  plane  has  a  BSP  of  size 
0(n).  Generalize  the  result  to  higher  dimensions. 

12.14  A  triangle  is  called  a-fat  if  all  its  angles  are  at  least  a.  Prove  that  any 
set  of  a-fat  disjoint  triangles  in  the  plane  has  density  0(\/a).  Use 
this  to  show  that  any  set  of  n  disjoint  a-fat  triangles  admits  a  BSP  of 
size  0(«log(l /a)),  and  argue  that  this  implies  that  any  set  of  n  disjoint 
squares  admits  a  BSP  of  size  O(n). 

12.15  Give  an  example  of  a  set  of  n  triangles  whose  density  is  a  constant 
such  that  any  auto-partition — that  is,  any  BSP  that  only  uses  splitting 
planes  containing  input  triangles — has  size  G(n2).  (This  example  shows 
that  the  randomized  algorithm  of  Section  12.4  can  produce  a  BSP  of 
expected  size  £2(n2)  even  when  the  density  of  the  input  set  of  triangles  is 
a  constant.)  Hint: Use  a  set  of  triangles  that  are  all  parallel  to  the  z-axis 
and  whose  projections  onto  the  xy- plane  form  a  grid. 

12.16  Let  T  be  a  set  of  disjoint  triangles  in  the  plane.  Instead  of  taking  the 
bounding-box  vertices  of  the  triangles  as  guards,  we  could  also  take  the 
vertices  of  the  triangles  themselves.  Show  that  this  is  not  a  good  set 
of  guards  by  giving  a  counterexample  to  Lemma  12.6  if  the  guards  are 
defined  in  this  way.  Can  you  give  a  counterexample  for  the  case  where 
all  triangles  are  equilateral  triangles? 

12.17  Show  that  Algorithm  LowDensityBSP2d  can  be  implemented  so  that 
it  runs  in  0{n2)  time. 

12.18  Generalize  Algorithm  LowDensityBSP2d  to  R3,  and  analyze  the  size 
of  the  resulting  BSP. 
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13  Robot  Motion  Planning 

Getting  Where  You  Want  to  Be 


One  of  the  ultimate  goals  in  robotics  is  to  design  autonomous  robots:  robots 
that  you  can  tell  what  to  do  without  having  to  say  how  to  do  it.  Among  other 
things,  this  means  a  robot  has  to  be  able  to  plan  its  own  motion. 

To  be  able  to  plan  a  motion,  a  robot  must  have  some  knowledge  about  the 
environment  in  which  it  is  moving.  For  example,  a  mobile  robot  moving  around 
in  a  factory  must  know  where  obstacles  are  located.  Some  of  this  information — 
where  walls  and  machines  are  located — can  be  provided  by  a  floor  plan.  For 
other  information  the  robot  will  have  to  rely  on  its  sensors.  It  should  be  able  to 
detect  obstacles  that  are  not  on  the  floor  plan — people,  for  instance.  Using  the 
information  about  the  environment,  the  robot  has  to  move  to  its  goal  position 
without  colliding  with  any  of  the  obstacles. 

This  motion  planning  problem  has  to  be  solved  whenever  any  kind  of  robot 
wants  to  move  in  physical  space.  The  description  above  assumed  that  we  have 
an  autonomous  robot  moving  around  in  some  factory  environment.  That  kind  of 
robot  is  still  quite  rare  compared  to  the  robot  arms  that  are  now  widely  employed 
in  industry. 

A  robot  arm,  or  articulated  robot,  consists  of  a  number  of  links,  connected 
by  joints.  Normally,  one  end  of  the  arm — its  base — is  firmly  connected  to  the 
ground,  while  the  other  end  carries  a  hand  or  some  kind  of  tool.  The  number 
of  links  varies  between  three  to  six  or  even  more.  The  joints  are  usually  of  two 
types,  the  revolute  joint  type  that  allows  the  links  to  rotate  around  the  joint, 
much  like  an  elbow,  or  the  prismatic  joint  type  that  allows  one  of  the  links  to 
slide  in  and  out.  Robot  arms  are  mostly  used  to  assemble  or  manipulate  parts 
of  an  object,  or  to  perform  tasks  like  welding  or  spraying.  To  do  this,  they 
must  be  able  to  move  from  one  position  to  another,  without  colliding  with  the 
environment,  the  object  they  are  operating  on,  or — an  interesting  complication — 
with  themselves. 


In  this  chapter  we  introduce  some  of  the  basic  notions  and  techniques  used  in 
motion  planning.  The  general  motion  planning  problem  is  quite  difficult,  and 
we  shall  make  some  simplifying  assumptions. 

The  most  drastic  simplification  is  that  we  will  look  at  a  2-dimensional  motion 
planning  problem.  The  environment  will  be  a  planar  region  with  polygonal 
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Chapter  13  obstacles,  and  the  robot  itself  will  also  be  polygonal.  We  also  assume  that  the 
robot  motion  planning  environment  is  static — there  are  no  people  walking  in  the  way  of  our  robot — and 

known  to  the  robot.  The  restriction  to  planar  robots  is  not  as  severe  as  it  appears 
at  first  sight:  for  a  robot  moving  around  on  a  work  floor,  a  floor  plan  showing 
the  location  of  walls,  tables,  machines,  and  so  on,  is  often  sufficient  to  plan  a 
motion. 

The  types  of  motions  a  robot  can  execute  depend  on  its  mechanics.  Some 
robots  can  move  in  any  direction,  while  others  are  constrained  in  their  motions. 
Car-like  robots ,  for  instance,  cannot  move  sideways — otherwise  parallel  parking 
would  be  less  challenging.  In  addition,  they  often  have  a  certain  minimum  turn¬ 
ing  radius.  The  geometry  of  the  motions  of  car-like  robots  is  quite  complicated, 
so  we  will  restrict  ourselves  to  robots  that  can  move  in  arbitrary  directions. 
In  fact,  we  will  mainly  look  at  robots  that  can  translate  only;  at  the  end  of 
the  chapter  weTl  briefly  consider  the  case  of  robots  that  can  also  change  their 
orientation  by  rotation. 


13.1  Work  Space  and  Configuration  Space 

Let  3?  be  a  robot  moving  around  in  a  2-dimensional  environment,  or  work 
space ,  consisting  of  a  set  S  =  {CPi , . . . ,  CPf }  of  obstacles.  We  assume  that  Jv 
is  a  simple  polygon.  A  placement ,  or  configuration ,  of  the  robot  can  now  be 
specified  by  a  translation  vector.  We  denote  the  robot  translated  over  a  vector 
(x,y)  by  [R(A,y).  For  instance,  if  the  robot  is  the  polygon  with  vertices  (1,-1), 
(1,1),  (0,3),  (—1,1),  and  (-1,-1),  then  the  vertices  of  fR(6, 4)  are  (7,3),  (7,5), 
(6,7),  (5,5),  and  (5,3).  With  this  notation,  a  robot  can  be  specified  by  listing 
the  vertices  of  fR(0, 0). 

An  alternative  way  to  view  this  is  in  terms  of  a  reference  point.  This  is  most 
intuitive  if  the  origin  (0,0)  lies  in  the  interior  of  3?(0,0).  By  definition,  this 
point  is  then  called  the  reference  point  of  the  robot.  We  can  specify  a  placement 
of  Jv  by  simply  stating  the  coordinates  of  the  reference  point  if  the  robot  is 
in  the  given  placement.  Thus  ^R(x,y)  specifies  that  the  robot  is  placed  with 
its  reference  point  at  (x,y).  In  general,  the  reference  point  does  not  have  to 
be  inside  the  robot;  it  can  also  be  a  point  outside  the  robot,  which  we  might 
imagine  to  be  attached  to  the  robot  by  an  invisible  stick.  By  definition,  this  point 
is  at  the  origin  for  !R(0,0). 

Now  suppose  the  robot  can  change  its  orientation  by  rotation,  say  around  its 
reference  point.  We  then  need  an  extra  parameter,  (j),  to  specify  the  orientation 
of  the  robot.  We  let  D?(x,y,  (j))  denote  the  robot  with  its  reference  point  at  (x,y) 
and  rotated  counterclockwise  through  an  angle  (j).  So  what  is  specified  initially 
is  J?(0,0,0). 

In  general,  a  placement  of  a  robot  is  specified  by  a  number  of  parameters  that 
corresponds  to  the  number  of  degrees  of  freedom  (DOF)  of  the  robot.  This 
number  is  two  for  planar  robots  that  can  only  translate,  and  it  is  three  for  planar 
robots  that  can  rotate  as  well  as  translate.  The  number  of  parameters  we  need 


284 


Section  13.1 

WORK  SPACE  AND  CONFIGURATION 
SPACE 


placement  01  (p)  of  the  robot  in  the  work  space. 

In  the  example  of  a  translating  and  rotating  robot  in  the  plane  the  config¬ 
uration  space  is  3-dimensional.  A  point  (x,y,(j>)  in  this  space  corresponds  to 
the  placement  Ot(x,y,(j))  in  the  work  space.  The  configuration  space  is  not  the 
Euclidean  3-dimensional  space;  it  is  the  space  R2  x  [0 :  360).  Because  rotations 
over  zero  and  360  degrees  are  equivalent,  the  configuration  space  of  a  rotating 
robot  has  a  special  topology,  which  is  like  a  cylinder. 

The  configuration  space  of  a  translating  robot  in  the  plane  is  the  2-dimen- 
sional  Euclidean  plane,  and  therefore  identical  to  the  work  space.  Still,  it  is 
useful  to  distinguish  the  two  notions:  the  work  space  is  the  space  where  the 
robot  actually  moves  around — the  real  world,  so  to  speak — and  the  configuration 
space  is  the  parameter  space  of  the  robot.  A  polygonal  robot  in  the  work  space 
is  represented  by  a  point  in  configuration  space,  and  any  point  in  configuration 
space  corresponds  to  some  placement  of  an  actual  robot  in  work  space. 

We  now  have  a  way  to  specify  a  placement  of  the  robot,  namely  by  specifying 
values  for  the  parameters  determining  the  placement  or,  in  other  words,  by 
specifying  a  point  in  configuration  space.  But  clearly  not  all  points  in  con¬ 
figuration  space  are  possible;  points  corresponding  to  placements  where  the 
robot  intersects  one  of  the  obstacles  in  5  are  forbidden.  We  call  the  part  of 
the  configuration  space  consisting  of  these  points  the  forbidden  configuration 
space ,  or  forbidden  space  for  short.  It  is  denoted  by  Cforb(!R,S).  The  rest  of  the 
configuration  space,  which  consists  of  the  points  corresponding  to  free  place¬ 
ments — placements  where  the  robot  does  not  intersect  any  obstacle — is  called 
the  free  configuration  space,  or  free  space,  and  it  is  denoted  by  Cfree(D?,S). 

A  path  for  the  robot  maps  to  a  curve  in  the  configuration  space,  and  vice 
versa:  every  placement  along  the  path  simply  maps  to  the  corresponding  point 
in  configuration  space.  A  collision-free  path  maps  to  a  curve  in  the  free  space. 
Figure  13.1  illustrates  this  for  a  translating  planar  robot.  On  the  left  the  work 
space  is  shown,  with  a  collision-free  path  from  the  initial  position  to  the  goal 
position  of  the  robot.  On  the  right  the  configuration  space  is  shown,  with  the 
grey  area  indicating  the  forbidden  part  of  it.  The  unshaded  area  in  between 
the  grey  area  is  the  free  space.  For  clarity,  the  obstacles  are  still  shown  in 
the  configuration  space,  although  they  have  no  meaning  there.  The  curve 
corresponding  to  the  collision-free  path  is  also  shown. 

We  have  seen  how  to  map  placements  of  the  robot  to  points  in  the  configu¬ 
ration  space,  and  paths  of  the  robot  to  curves  in  that  space.  Can  we  also  map 
obstacles  to  configuration  space?  The  answer  is  yes:  an  obstacle  CP  is  mapped 
to  the  set  of  points  p  in  configuration  space  such  that  'Jl(p)  intersects  CP.  The 
resulting  set  is  called  the  configuration-space  obstacle,  or  6 -obstacle  for  short, 
of  CP. 


for  a  robot  in  3-dimensional  space  is  higher,  of  course:  a  translating  robot  in  R3 
has  three  degrees  of  freedom,  and  a  robot  that  is  free  to  translate  and  rotate  in 
R3  has  six  degrees  of  freedom. 

The  parameter  space  of  a  robot  CR  is  usually  called  its  configuration  space.  It  is 
denoted  by  C(CR).  A  point  p  in  this  configuration  space  corresponds  to  a  certain 
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Figure  13.1 
A  path  in  the  work  space  and  the 
corresponding  curve  in  the 
configuration  space 
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work  space 


configuration  space 


C-obstacles  may  overlap  even  when  the  obstacles  in  the  work  space  are 
disjoint.  This  happens  when  there  are  placements  of  the  robot  where  it  intersects 
more  than  one  obstacle  at  the  same  time. 

There  is  one  subtle  issue  that  we  have  ignored  so  far:  does  the  robot  collide 
with  an  obstacle  when  it  touches  that  obstacle?  In  other  words,  do  we  define 
the  obstacles  to  be  topologically  open  or  closed  sets?  In  the  remainder  we  will 
choose  the  first  option:  obstacles  are  open  sets,  so  that  the  robot  is  allowed  to 
touch  them.  This  is  of  little  importance  in  this  chapter,  but  it  will  become  useful 
in  Chapter  15.  In  practice  a  movement  where  the  robot  passes  very  close  to  an 
obstacle  cannot  be  considered  safe  because  of  possible  errors  in  robot  control. 
Such  movements  can  be  avoided  by  slightly  enlarging  all  the  obstacles  before 
the  computation  of  a  path. 


13.2  A  Point  Robot 

Before  we  try  to  plan  the  motion  of  a  polygonal  robot  in  the  plane,  let’s  have  a 
look  at  point  robots.  Given  the  mapping  from  work  space  to  configuration  space 
that  we  saw  in  the  previous  section  this  is  not  such  a  strange  idea.  Furthermore, 
it’s  always  good  to  start  with  a  simple  case.  As  before,  we  denote  the  robot 
by  tk  and  we  denote  the  obstacles  by  !P  | . . . . ,  'P, .  The  obstacles  are  polygons 
with  disjoint  interiors,  whose  total  number  of  vertices  is  denoted  by  n.  For  a 
point  robot,  the  work  space  and  the  configuration  space  are  identical.  (That  is  to 
say,  if  we  make  the  natural  assumption  that  its  reference  point  is  the  point  robot 
itself.  Otherwise  the  configuration  space  is  a  translated  copy  of  the  work  space.) 

Rather  than  finding  a  path  from  a  particular  start  position  to  a  particular  goal 
position  we  will  construct  a  data  structure  storing  a  representation  of  the  free 
space.  This  data  structure  can  then  be  used  to  compute  a  path  between  any  two 
given  start  and  goal  positions.  Such  an  approach  is  useful  if  the  work  space  of 
the  robot  does  not  change  and  many  paths  have  to  be  computed. 

To  simplify  the  description  we  restrict  the  motion  of  the  robot  to  a  large  bounding 
box  B  that  contains  the  set  of  polygons.  In  other  words,  we  add  one  extra 


infinitely  large  obstacle,  which  is  the  area  outside  B.  The  free  configuration  Section  13.2 
space  Cfree  now  consists  of  the  part  of  B  not  covered  by  any  obstacle:  a  point  robot 


Cfree  =  B  \  (J  3V 
i=l 

The  free  space  is  a  possibly  disconnected  region,  which  may  have  holes.  Our 
goal  is  to  compute  a  representation  of  the  free  space  that  allows  us  to  find  a 
path  for  any  start  and  goal  position.  We  will  use  the  trapezoidal  map  for  this. 
Recall  from  Chapter  6  that  the  trapezoidal  map  of  a  set  of  non-intersecting  line 
segments  inside  a  bounding  box  is  obtained  by  drawing  two  vertical  extensions 
from  every  segment  endpoint,  one  going  upward  until  a  segment  (or  the  bound¬ 
ing  box)  is  hit,  and  one  going  downward  until  a  segment  (or  the  bounding  box) 
is  hit.  In  Chapter  6  we  developed  a  randomized  algorithm,  TrapezoidalMap, 
that  computes  the  trapezoidal  map  of  a  set  of  n  segments  in  O(n\ogn)  expected 
time.  The  following  algorithm,  which  computes  a  representation  of  the  free 
space,  uses  this  algorithm  as  a  subroutine. 

Algorithm  ComputeFreeSpace(S) 

Input.  A  set  S  of  disjoint  polygons. 

Output.  A  trapezoidal  map  of  Cfree(^R,S)  for  a  point  robot  fR. 

1 .  Let  E  be  the  set  of  edges  of  the  polygons  in  S. 

2.  Compute  the  trapezoidal  map  7(E)  with  algorithm  TrapezoidalMap 
described  in  Chapter  6. 

3.  Remove  the  trapezoids  that  lie  inside  one  of  the  polygons  from  7(E)  and 
return  the  resulting  subdivision. 


vertical 

extensions 


The  algorithm  is  illustrated  in  Figure  13.2.  Part  (a)  of  the  figure  shows  the 
trapezoidal  map  of  the  obstacle  edges  inside  the  bounding  box;  this  is  what  is 
computed  in  line  2  of  the  algorithm.  Part  (b)  shows  the  map  after  the  trapezoids 
inside  the  obstacles  have  been  removed  in  line  3. 


Figure  13.2 

Computing  a  trapezoidal  map  of  the 
free  space 


There  is  one  detail  left:  how  do  we  find  the  trapezoids  inside  the  obstacles, 
which  have  to  be  removed?  This  is  not  so  difficult,  because  after  running 
TrapezoidalMap  we  know  for  each  trapezoid  the  edge  that  bounds  it  from 
the  top,  and  we  known  to  which  obstacle  that  edge  belongs.  To  decide  whether 

or  not  to  remove  the  trapezoid,  it  suffices  to  check  whether  the  edge  bounds  _ 

the  obstacle  from  above  or  from  below.  The  latter  test  takes  only  constant  time,  287 


Chapter  13  because  the  edges  of  the  obstacles  are  listed  in  order  along  the  boundary  so  that 
robot  motion  planning  the  obstacle  lies  to  a  specific,  known  side  of  the  edges. 

The  expected  time  taken  by  TrapezoidalMap  is  O(n\ogn),  so  we  get  the 
following  result. 

Lemma  13.1  A  trapezoidal  map  of  the  free  configuration  space  for  a  point  robot 
moving  among  a  set  of  disjoint  polygonal  obstacles  with  n  edges  in  total  can  be 
computed  by  a  randomized  algorithm  in  0(n  log/;)  expected  time. 

In  what  follows,  we  will  denote  the  trapezoidal  map  of  the  free  space  by 

T(efree). 

How  do  we  use  T(Cfree)  to  find  a  path  from  a  start  position  /?SLart  to  a  goal 
position  g0ai? 

If  Pstart  and  Pgoal  are  in  the  same  trapezoid  of  the  map,  this  is  easy:  the  robot 
can  simply  move  to  its  goal  in  a  straight  line. 

If  the  start  and  goal  position  are  in  different  trapezoids,  however,  then  things 
are  not  so  easy.  In  this  case  the  path  will  cross  a  number  of  trapezoids  and  it 
may  have  to  make  turns  in  some  of  them.  To  guide  the  motion  across  trapezoids 
we  construct  a  road  map  through  the  free  space.  The  road  map  is  a  graph  Sroad. 
which  is  embedded  in  the  plane.  More  precisely,  it  is  embedded  in  the  free 
space.  Except  for  an  initial  and  final  portion,  paths  will  always  follow  the  road 
map.  Notice  that  any  two  neighboring  trapezoids  share  a  vertical  edge  that  is  a 
vertical  extension  of  a  segment  endpoint.  This  leads  us  to  define  the  road  map 
as  follows.  We  place  one  node  in  the  center  of  each  trapezoid,  and  we  place 
one  node  in  the  middle  of  each  vertical  extension.  There  is  an  arc  between  two 


Figure  13.3 
A  road  map 

nodes  if  and  only  if  one  node  is  in  the  center  of  a  trapezoid  and  the  other  node 
is  on  the  boundary  of  that  same  trapezoid.  The  arcs  are  embedded  in  the  plane 
as  straight  line  segments,  so  following  an  arc  in  the  road  map  corresponds  to 
a  straight-line  motion  of  the  robot.  Figure  13.3  illustrates  this.  The  road  map 
Sroad  can  be  constructed  in  0(n)  time  by  traversing  the  doubly-connected  edge 
list  of  T(Cfree)-  Using  the  arcs  in  the  road  map  we  can  go  from  the  node  in  the 
center  of  one  trapezoid  to  the  node  in  the  center  of  a  neighboring  trapezoid  via 
the  node  on  their  common  boundary. 
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We  can  use  the  road  map,  together  with  the  trapezoidal  map,  to  plan  a  motion 
from  a  start  to  a  goal  position.  To  this  end  we  first  determine  the  trapezoids 
Astart  and  Agoai  containing  these  points.  If  they  are  the  same  trapezoid,  then  we 
move  from  pstart to  Pgoai  in  a  straight  line.  Otherwise,  let  vstart  and  vgoai  be  the 
nodes  of  Sroad  that  have  been  placed  in  the  center  of  these  trapezoids.  The  path 
from  pstart  to  pgoai  that  we  will  construct  now  consists  of  three  parts:  the  first 
part  is  a  straight-line  motion  from  pstart  to  vstart-  the  second  part  is  a  path  from 
Vstart  to  Vgoai  along  the  arcs  of  the  road  map,  and  the  final  part  is  a  straight-line 
motion  from  vgoai  to  pgoai-  Figure  13.4  illustrates  this. 


The  following  algorithm  summarizes  how  a  path  is  found. 

Algorithm  COMPUTEPATH(T( Cfree) ,  Sroad > Pstart > Pgoai) 

Input.  The  trapezoidal  map  T(Cfree)  of  the  free  space,  the  road  map  Sroad-  a  start 
position  pstart-  and  goal  position  pgoai- 

Output.  A  path  from  pstart  to  pgoai  if  it  exists.  If  a  path  does  not  exist,  this  fact  is 
reported. 

1 .  Find  the  trapezoid  Astart  containing  pstart  and  the  trapezoid  Asoai  containing 

Pgoai  • 

2.  if  Astart  or  Ag0ai  does  n°t  exist 

3.  then  Report  that  the  start  or  goal  position  is  in  the  forbidden  space. 

4.  else  Let  vstart  be  the  node  of  Sroad  in  the  center  of  Astart- 

5.  Let  Vg0ai  be  the  node  of  Sroad  in  the  center  of  Agoai. 

6.  Compute  a  path  in  Sroad  from  vstart  to  vgoai  using  breadth-first 
search. 

7.  if  there  is  no  such  path 

8.  then  Report  that  there  is  no  path  from  psta rt  to  pgoa i- 

9.  else  Report  the  path  consisting  of  a  straight-line  motion  from 

Pstart  to  Vstart-  the  path  found  in  Sroad-  and  a  straight-line 
motion  from  vgoai  to  pgoai  • 


Section  13.2 

A  POINT  ROBOT 


Figure  13.4 

A  path  computed  from  the  road  map  of 
Figure  13.3 


Before  we  analyze  the  time  complexity  of  algorithm,  let’s  think  about  its  cor¬ 
rectness.  Are  the  paths  we  report  always  collision-free,  and  do  we  always  find  a 
collision-free  path  if  one  exists? 
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The  first  question  is  easy  to  answer:  any  path  we  report  must  be  collision- 
free,  since  it  consists  of  segments  inside  trapezoids  and  all  trapezoids  are  in  the 
free  space. 

To  answer  the  second  question,  suppose  that  there  is  a  collision-free  path 
from  pstart  to  /?goai .  Obviously  pstart  and  Pgoai  must  lie  in  one  of  the  trapezoids 
covering  the  free  space,  so  it  remains  to  show  that  there  is  a  path  in  Sroad  from 
Vstart  to  Vgoai .  The  path  from  psta rt  to  pg0ai  must  cross  a  sequence  of  trapezoids. 
Denote  the  sequence  of  trapezoids  by  Ai,  Aj, . . .  ,A*.  By  definition,  Ai  =  Astart 
and  A*  =  Agoai.  Let  v )  be  the  node  of  Sroad  that  is  in  the  center  of  A,-.  If  the  path 
goes  from  A,-  to  A(+ 1 ,  then  A,-  and  A,-+ 1  must  be  neighbors,  so  they  share  a  vertical 
extension.  But  Sroad  is  constructed  such  that  the  nodes  of  such  trapezoids  are 
connected  through  the  node  on  their  common  boundary.  Hence,  there  is  a  path 
(consisting  of  two  arcs)  in  Sroad  from  v,-  to  V,+i.  This  means  that  there  is  a  path 
from  Vi  to  Vk  as  well.  It  follows  that  the  breadth-first  search  in  Sroad  will  find 
some  (possibly  different)  path  from  vstart  to  vgoai. 

We  now  analyze  the  time  the  algorithm  takes. 

Finding  the  trapezoids  containing  the  start  and  goal  can  be  done  in  O(\ogn) 
using  the  point  location  structure  of  Chapter  6.  Alternatively,  we  can  simply 
check  all  trapezoids  in  linear  time;  we  shall  see  that  the  rest  of  the  algorithm 
takes  linear  time  anyway,  so  this  does  not  increase  the  time  bound  asymptoti¬ 
cally. 

The  breadth-first  search  takes  linear  time  in  the  size  of  the  graph  Sroad-  This 
graph  has  one  node  per  trapezoid  plus  one  node  per  vertical  extension.  Both 
the  number  of  vertical  extensions  and  the  number  of  trapezoids  are  linear  in  the 
total  number  of  vertices  of  the  obstacles.  The  number  of  arcs  in  the  graph  is 
linear  as  well,  because  it  is  planar.  Hence,  the  breadth-first  search  takes  0(n) 
time. 

The  time  to  report  the  path  is  bounded  by  the  maximum  number  of  arcs  on 
a  path  in  Sroad.  which  is  0(n). 

We  get  the  following  theorem. 

Theorem  13.2  Let  3?  be  a  point  robot  moving  among  a  set  S  of  polygonal 
obstacles  with  n  edges  in  total.  We  can  preprocess  S  in  Oin  log/;)  expected  time, 
such  that  between  any  start  and  goal  position  a  collision-free  path  for  Jv  can  be 
computed  in  O(n)  time,  if  it  exists. 

The  path  computed  by  the  algorithm  of  this  section  is  collision-free,  but  we 
can  give  no  guarantee  that  the  path  does  not  make  large  detours.  In  Chapter  15 
we  will  develop  an  algorithm  that  actually  computes  the  shortest  possible  path. 
That  algorithm,  however,  will  be  slower  by  an  order  of  magnitude. 


13.3  Minkowski  Sums 

_  In  the  previous  section  we  solved  the  motion  planning  problem  for  a  point 

290  robot;  we  computed  a  trapezoidal  map  of  its  free  space  and  used  that  map  to 


plan  its  motions.  The  same  approach  can  be  used  if  the  robot  is  a  polygon.  Section  13.3 

There  is  one  difference  that  makes  dealing  with  a  polygonal  robot  more  difficult:  minkowski  sums 

the  configuration-space  obstacles  are  no  longer  the  same  as  the  obstacles  in 
work  space.  Therefore  we  start  by  studying  the  free  configuration  space  of  a 
translating  polygonal  robot.  In  the  next  section  we  will  then  describe  how  to 
compute  it,  so  that  we  can  use  it  to  plan  the  motion  of  the  robot. 

We  assume  that  the  robot  3?  is  convex,  and  for  the  moment  we  also  assume  that 
the  obstacles  are  convex.  Recall  that  we  use  CP(x,y)  to  denote  the  placement 
of  3?  with  its  reference  point  at  (x,y).  The  configuration- space  obstacle,  or 
C-obstacle,  of  an  obstacle  CP  and  the  robot  3?  is  defined  as  the  set  of  points  in 
configuration  space  such  that  the  corresponding  placement  of  3?  intersects  3’. 

So  if  we  denote  the  C-obstacle  of  CP  by  CCP,  then  we  have 

CCP  :=  {(x,y)  :  CP(x,y)  D CP  ^  0}. 


You  can  visualize  the  shape  of  CCP  by  sliding  CP  along  the  boundary  of  CP;  the 
curve  traced  by  the  reference  point  of  CP  is  the  boundary  of  CCP. 

We  can  describe  this  in  a  different  way  using  the  notion  of  Minkowski  sums. 
The  Minkowski  sum  of  two  sets  Si  C  K2  and  Si  C  R2,  denoted  by  .S']  ®  Si.  is 
defined  as 

Si ©S2  :=  {p  +  g  :  p€Si,q€S2}, 

where  p  +  q  denotes  the  vector  sum  of  the  vectors  p  and  q ,  that  is,  if  p  =  (p.x,py) 
and  q  =  ( qx,qy )  then  we  have 

p +q-=  ( px+qx,Py+qy )■ 

Because  a  polygon  is  a  planar  set  the  definition  of  Minkowski  sums  also  applies 
to  them. 

To  be  able  to  express  the  C-obstacles  as  Minkowski  sums,  we  need  one 
more  piece  of  notation.  For  a  point  p  =  ( px,Py )  we  define  — p  :=  (— px,  —  p_v), 
and  for  a  set  S  we  define  —  S  :=  {— p  :  p  G  S}.  In  other  words,  we  get  —  S  by 
reflecting  S  about  the  origin.  We  now  have  the  following  theorem. 


Theorem  13.3  Let  CP  be  a  planar,  translating  robot  and  let  CP  be  an  obstacle. 
Then  the  C -obstacle  of  CP  is  CP©  (— CP(0,0)). 

Proof.  We  have  to  prove  that  CPfv.y)  intersects  CP  if  and  only  if  we  have  that 

{x,y)  G  CP©(— CP(0,0)). 

First,  suppose  that  CP(x,y)  intersects  CP,  and  let  q  =  ( qx,qy )  be  a  point  in  the 
intersection.  It  follows  from  q  G  3^(x,y)  that  we  have  (qx  —  x, qy  —  y)  G  CP(0, 0) 
or,  equivalently,  that  (— qx  +  x,  —  qy+y)  G  —  CP(0,0).  Because  we  also  have 
q  G  CP,  this  implies  that  (x,y)  G  CP©  (— CP(0,0)). 

Conversely,  let  ( x,y )  G  CP©  ( — CP(0,0)).  Then  there  are  points  (rx, ry)  G 
CR(0,0)  and  (pXlpy)  G  CP  such  that  (x,y)  =  (px  —  rXlpy  —  ry)  or,  in  other  words, 
such  that  px  =  rx +x  and  pv  =  ry  +y,  which  implies  that  CP(x,  y)  intersects  CP.  0 
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Figure  13.5 

extreme  point  on  a  Minkowski  sum 
is  the  sum  of  extreme  points 
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So  for  a  planar  translating  robot  3  the  C-obstacles  are  the  Minkowski  sums 
of  the  obstacles  and  —  3(0,0).  (Sometimes  IP®  (—31(0,0))  is  referred  to  as 
the  Minkowski  difference  of  3*  and  3(0,0).  Since  Minkowski  differences  are 
defined  differently  in  the  mathematics  literature  we  shall  avoid  this  term.) 

In  the  remainder  of  this  section  we  will  derive  some  useful  properties  of 
Minkowski  sums  and  develop  an  algorithm  to  compute  them. 

We  start  with  a  simple  observation  about  extreme  points  on  Minkowski  sums. 

Observation  13.4  Let  7  and  3?  be  two  objects  in  the  plane,  and  let  CLP  :=  3®  3. 
An  extreme  point  in  direction  d  on  63  is  the  sum  of  extreme  points  in  direction  d 
on  7  and  7. 

Figure  13.5  illustrates  the  observation.  Using  this  observation  we  now  prove 
that  the  Minkowski  sum  of  two  convex  polygons  has  linear  complexity. 


Theorem  13.5  Let  7  and  7  be  two  convex  polygons  with  n  and  m  edges,  re¬ 
spectively.  Then  the  Minkowski  sum  7®  7  is  a  convex  polygon  with  at  most 
n  +  m  edges. 

Proof.  The  convexity  of  the  Minkowski  sum  of  two  convex  sets  follows  directly 
from  the  definition. 

To  see  that  the  complexity  of  the  Minkowski  sum  is  linear,  consider  an  edge 
e  of  IP  ®  3.  This  edge  is  extreme  in  the  direction  of  its  outer  normal.  Hence,  it 
must  be  generated  by  points  on  7  and  3  that  are  extreme  in  the  same  direction. 
Moreover,  at  least  one  of  7  and  7  must  have  an  edge  that  is  extreme  in  that 
direction.  We  charge  e  to  this  edge.  This  way  each  edge  is  charged  at  most  once, 
so  the  total  number  of  edges  is  at  most  n  +  m.  (If  7  and  3  don’t  have  parallel 
edges,  then  the  number  of  edges  of  the  Minkowski  sum  is  exactly  n  +  m.)  Li 

So  the  Minkowski  sum  of  two  convex  polygons  is  convex  and  has  linear 
complexity.  But  there  is  more:  the  boundaries  of  two  Minkowski  sums  can  only 
intersect  in  a  very  special  manner.  To  make  this  precise,  we  need  one  more 
piece  of  terminology. 

Consider  two  planar  objects  o \  and  02,  each  bounded  by  a  simple  closed 
curve.  Intuitively,  the  pair  01 , 02  is  called  a  pair  of  pseudodiscs  if  their  bound¬ 
aries  do\  and  do2  intersect  in  at  most  two  points.  Figure  13.6  illustrates  this. 
In  degenerate  situations — when  the  boundaries  have  a  1 -dimensional  overlap 


for  instance — this  definition  is  not  quite  general  enough.  Therefore  we  for-  Section  13.3 
mally  define  o\ ,  02  to  be  a  pair  of  pseudodiscs  if  the  following  condition  holds:  minkowski  sums 
do\  nint(c>2)  is  connected  and  do2  flint(oi)  is  connected.  (Here  int(o)  denotes 
the  interior  of  an  object  o.)  A  collection  of  objects,  each  bounded  by  a  simple 
closed  curve,  is  called  a  collection  of  pseudodiscs  if  every  pair  of  objects  in  the 
collection  is  a  pair  of  pseudodiscs.  Examples  of  collections  of  pseudodiscs  are 
collections  of  discs,  and  collections  of  axis-parallel  squares.  Note  that  the  pseu¬ 
dodisc  property  is  about  the  way  in  which  (the  boundaries  of)  two  objects  can 
interact — it  does  not  make  sense  to  say  of  a  single  object  that  it  is  a  pseudodisc. 

pseudodiscs  not  pseudodiscs 


Figure  13.6 

The  pseudodisc  property 


Now  consider  two  polygons  (P  and  V .  We  say  that  an  intersection  point 
p  G  ckPndy'  is  a  boundary  crossing  if  <9(P  crosses  from  the  interior  of  T'  to 
the  exterior  of  3J/  at  p.  Polygonal  pseudodiscs  satisfy  the  following  important 
property: 

Observation  13.6  A  pair  of  polygonal  pseudodiscs  CP,  CP'  defines  at  most  two 
boundary  crossings. 


Below  we  will  prove  that  a  collection  of  Minkowski  sums  forms  a  collection 
of  pseudodiscs.  But  first  we  need  one  more  observation  about  directions  and 
extreme  points  on  pairs  of  convex  polygons  with  disjoint  interiors.  We  say 
that  one  polygon  is  more  extreme  in  a  direction  d  than  another  polygon  if  its 
extreme  points  lie  further  in  that  direction  than  the  extreme  points  of  the  other 
polygon.  For  instance,  a  polygon  is  more  extreme  in  the  positive  x-di  recti  on  if 
its  rightmost  points  lie  to  the  right  of  the  rightmost  points  of  the  other  polygon. 

We  will  look  at  extreme  points  for  various  directions.  To  this  end  we  model 
the  set  of  all  directions  by  the  unit  circle  centered  at  the  origin:  a  point  p  on 
the  unit  circle  represents  the  direction  given  by  the  vector  from  the  origin  to  p. 
The  range  from  a  direction  d\  to  a  direction  c/2  is  defined  as  the  directions 
corresponding  to  points  in  the  counterclockwise  circle  segment  from  the  point 
representing  d\  to  the  point  representing  ob.  Note  that  the  range  from  d\  to  cb  is 
not  the  same  as  the  range  from  cb  to  d\.  The  following  observation  is  illustrated 
in  Figure  13.7. 


Observation  13.7  Let  (Pi  and  3b  be  convex  polygons  with  disjoint  interiors. 
Letd\  and  cb  be  directions  in  which  (P  |  is  more  extreme  than  3b-  Then  either 
(P 1  is  more  extreme  than  3b  in  all  directions  in  the  range  from  d\  to  cb,  or  it  is 
more  extreme  in  all  directions  in  the  range  from  c/2  to  d  1 . 


We  are  now  ready  to  prove  that  Minkowski  sums  are  pseudodiscs. 
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Figure  13.7 

One  convex  polygon  is  more  extreme 
than  another  for  a  connected  range  of 
directions 
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Theorem  13.8  Let  IP  |  and  JS  be  two  convex  polygons  with  disjoint  interiors, 
and  let  01  be  another  convex  polygon.  Then  the  two  Minkowski  sums  IP]  ©  IP 
and  ©  CP  are  pseudodiscs. 

Proof.  Define  CCPi  :=  CPi  ©  3?  and  C CP2  :=  CP2  ©  CP.  By  symmetry,  it  suffices  to 
show  that  <9CCPi  fl  in^CJS)  is  connected. 

Suppose  <9  CCPi  fl  intfCPIPj  is  not  connected.  Then  there  are  four  alternat¬ 
ing  directions  dp,dq,dr,ds — these  directions  are  outward  normals  of  points 
p,q,r,s  £  <9 CCPi  that  occur  in  the  given  order  along  <9  CCPi  with  p,r  £  int(CCP2) 
and  q,s  int(CCP2) — such  that  CCP2  is  more  extreme  than  CCPi  in  directions  dp 
and  dr  while  this  is  not  the  case  for  dq  and  ds.  From  Observation  13.4  it  follows 
that  CPj  is  more  extreme  than  CP2  in  directions  dp  and  dr  and  not  more  extreme 
in  directions  dq  and  ds.  But  this  contradicts  Observation  13.7.  0 

This  result  is  useful  in  combination  with  the  following  theorem. 

Theorem  13.9  Let  S  be  a  collection  of  convex  polygonal  pseudodiscs  with  n 
edges  in  total.  Then  the  complexity  of  their  union  is  at  most  2 n. 

Proof.  We  prove  the  bound  by  charging  every  vertex  of  the  union  to  a  pseu¬ 
dodisc  vertex  in  such  a  way  that  any  pseudodisc  vertex  is  charged  at  most  two 
times.  This  leads  to  a  bound  of  2 n  on  the  complexity  of  the  union. 

The  charging  is  done  as  follows.  There  are  two  types  of  vertices  in  the  union 
boundary:  pseudodisc  vertices  and  intersection  points  of  the  interiors  of  two 
pseudodisc  edges. 

Vertices  of  the  former  type  are  simply  charged  to  themselves. 

Now  consider  a  union  vertex  v  that  is  the  intersection  of  an  edge  e  of  a 
pseudodisc  CP  €  S  and  an  edge  e'  of  a  pseudodisc  CP'  £  S.  Then  e  fl  e'  is  a 
boundary  crossing.  Hence,  by  Observation  13.6  either  e  does  not  have  another 
boundary  crossing  with  d  CP',  or  e'  does  not  have  another  boundary  crossing  with 
9CP  (or  both).  Assume  without  loss  of  generality  that  e  does  not  have  another 
crossing  with  <9CP'.  Starting  at  eC\e' ,  follow  e  into  the  interior  of  CP'.  Because  e 
does  not  cross  d  CP'  a  second  time,  we  must  reach  an  endpoint  of  e  before  we 
reach  the  exterior  of  CP'.  We  charge  v  to  this  endpoint  of  e. 

We  claim  that  if  we  do  the  charging  in  this  way,  then  every  pseudodisc 
vertex  v  of  any  pseudodisc  CP  is  charged  at  most  twice. 

First  consider  the  case  that  v  does  not  lie  in  the  interior  or  on  the  boundary 
of  any  other  pseudodisc  CP'.  Then  obviously  v  is  a  union  vertex,  and  it  is  charged 
only  by  itself.  Next  consider  the  case  where  v  lies  in  the  interior  of  some 


pseudodisc  7' .  This  means  that  v  lies  in  the  interior  of  the  union.  Now  follow  Section  13.3 

the  two  edges  of  CP  incident  to  v  until  the  union  boundary  is  reached  at  a  crossing  minkowski  sums 

with  some  other  edge;  these  two  crossings,  if  they  exist,  are  the  only  ones  that 

get  charged  to  v.  Finally,  if  v  lies  on  the  boundary  of  some  other  pseudodisc 

CP',  then  v  may  be  charged  by  itself  and  (similar  to  the  case  where  v  is  in  a 

pseudodisc  interior)  by  the  first  two  crossings  along  its  two  incident  edges.  It 

can  only  be  charged  from  both  its  incident  edges,  however,  when  the  incident 

edges  go  from  v  into  the  interior  of  the  union  of  the  pseudodiscs.  In  that  case  v 

is  not  a  union  vertex  and  it  will  not  be  charged  by  itself.  Hence,  in  all  cases  v  is 

charged  at  most  twice.  0 

The  proof  of  this  theorem  depends  heavily  on  the  pseudodiscs  being  polygo¬ 
nal,  but  the  theorem  itself  generalizes  to  arbitrary  pseudodiscs:  the  complexity 
of  the  union  of  any  set  of  pseudodiscs  is  linear  in  the  total  complexity  of  the 
pseudodiscs.  This  implies  for  instance  that  the  union  of  n  discs  in  the  plane  has 
0(n)  complexity.  In  fact,  the  following  stronger  result  holds:  the  number  of 
boundary  intersections  that  show  up  as  union  vertices  is  linear  in  the  number  of 
pseudodiscs.  This  more  general  theorem  is  a  lot  more  difficult  to  prove. 


Before  we  return  to  our  motion  planning  application,  we  give  an  algorithm  to 
compute  the  Minkowski  sum  of  two  convex  polygons  7  and  7.  A  very  simple 
algorithm  is  the  following.  For  each  pair  v,  w  of  vertices,  with  v  e  CP  and  w  £  7, 
compute  v  +  w.  Next,  compute  the  convex  hull  of  all  these  sums.  Unfortunately 
this  algorithm  inefficient  when  the  polygons  have  many  vertices,  because  it 
looks  at  every  pair  of  vertices.  Below  we  give  an  alternative  algorithm,  which  is 
as  easy  to  implement.  It  only  looks  at  pairs  of  vertices  that  are  extreme  in  the 
same  direction — this  is  allowed  because  of  Observation  13.4 — which  makes  it 
run  in  linear  time.  In  the  algorithm  we  use  the  notation  angle  (pq)  to  denote  the 
angle  that  the  vector  pq  makes  with  the  positive  x-axis. 

Algorithm  M I N  K  o  \v  s  K  [  S  i ;  M  ( 7 ,7) 

Input.  A  convex  polygon  7  with  vertices  vi, . . .  ,v„,  and  a  convex  polygon  3? 
with  vertices  w i , . . . ,  wm.  The  lists  of  vertices  are  assumed  to  be  in  counter¬ 
clockwise  order,  with  vi  and  w \  being  the  vertices  with  smallest  y-coordinate 
(and  smallest  x-coordinate  in  case  of  ties). 

Output.  The  Minkowski  sum  7  ©  (R. 

1.  ;«-i;;«-i 

2.  v„+i  4-  vi;  vn+2  <-  v2;  wm+ 1  <-  wp  wm+2  <-  w2 

3.  repeat 

4.  Add  \’i  +  xv  j  as  a  vertex  to  IP  ©  CP. 

5.  if  angle{v{Vi+i)  <  angle(wj\Vj+\) 

6.  then  /  4—  (i+1) 

7.  else  if  angle(vjVi+i)  >  angle(\VjWj+i) 

8.  then  j  <— (y  +  1) 

9.  else  i*-(i+l);j*-(j+l) 

10.  until  i  =  n  +  1  and  j  =  in  T  1 

MinkowskiSum  runs  in  linear  time,  because  at  each  execution  of  the  repeat 
loop  either  i  or  j  is  incremented  and — as  is  not  difficult  to  prove — they  will 
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Chapter  13  not  be  incremented  after  reaching  the  values  n  +  1  and  m  +  1 .  The  fact  that  the 

robot  motion  planning  correct  pairs  of  vertices  are  taken  is  similar  to  the  proof  of  Theorem  13.5;  one 

just  has  to  observe  that  any  vertex  of  the  Minkowski  sum  is  the  sum  of  two 
original  vertices  that  are  extreme  in  a  common  direction,  and  argue  that  the 
angle  test  ensures  that  all  extreme  pairs  are  found. 

We  conclude  with  the  following  theorem: 

Theorem  13.10  The  Minkowski  sum  of  two  convex  polygons  with  n  and  m 
vertices,  respectively,  can  be  computed  in  0(n  +  m)  time. 

What  happens  if  one  or  both  of  the  polygons  are  not  convex?  This  question 
is  not  so  hard  to  answer  if  we  realize  that  the  following  equality  holds  for  any 
sets  Si,  S2,  and  S3: 


Si  ©  (So  US3)  =  (Si  ©S2)  U  (Si  ©S3). 

Now  consider  the  Minkowski  sum  of  a  non-convex  polygon  IP  and  a  convex 
polygon  3^  with  n  and  m  vertices  respectively.  What  is  the  complexity  of  IP  ©  3?? 
We  know  from  Chapter  3  that  the  polygon  IP  can  be  triangulated  into  n  —  2 
triangles  1 1 , . . .  ,tn- 2,  where  n  is  its  number  of  vertices.  From  the  equality  above 
we  can  conclude  that 

n—2 

IP©  3?=  U  f,©3T 

i=l 

Since  f,  is  a  triangle  and  3?  a  convex  polygon  with  m  vertices,  we  know  that 
tj  ©  3?  is  a  convex  polygon  with  at  most  m  +  3  vertices.  Moreover,  the  triangles 
have  disjoint  interiors,  so  the  collection  of  Minkowski  sums  is  a  collection  of 
pseudodiscs.  Hence,  the  complexity  of  their  union  is  linear  in  the  sum  of  their 
complexities.  This  implies  that  the  complexity  of  IP©  3?  is  0(nm). 


Figure  13.8 

The  Minkowski  sum  of  a  non-convex 
and  a  convex  polygon 


This  upper  bound  on  the  complexity  of  a  non-convex  and  a  convex  polygon 
is  tight  in  the  worst  case.  To  see  this,  consider  a  polygon  IP  with  \  n/2 J  spikes 
pointing  upward,  and  a  much  smaller  polygon  3?  that  is  the  top  half  of  a  regular 
(2m  —  2)-gon.  The  Minkowski  sum  of  these  polygons  will  also  have  n/2 

_  spikes,  each  of  which  has  m  vertices  at  its  top.  Figure  13.8  illustrates  the 

296  construction. 


To  bound  the  complexity  of  the  Minkowski  sum  of  two  non-convex  polygons  CP  Section  13.4 

and  CR,  we  triangulate  both  polygons.  We  get  a  collection  of  n  —  2  triangles  translational  motion  planning 

and  a  collection  of  m  —  2  triangles  uj.  The  Minkowski  sum  of  CP  and  CR  is  now  the 

union  of  the  Minkowski  sums  of  the  pairs  tj.iij.  Each  sum  f,-  ©  uj  has  constant 

complexity.  Hence,  CP  ©  CR  is  the  union  of  (n  —  2)  (m  —  2)  constant-complexity 

polygons.  This  implies  that  the  total  complexity  of  CP© CR  is  0(n2m2).  Again, 

this  bound  is  tight  in  the  worst  case:  there  are  non-convex  polygons  whose 

Minkowski  sum  really  has  &(n2m2)  complexity.  Figure  13.9  illustrates  this. 


Figure  13.9 

The  Minkowski  sum  of  two  non-convex 
polygons 


The  following  theorem  summarizes  the  results  on  the  complexity  of  Minkowski 
sums.  For  completeness  the  complexity  in  the  case  of  two  convex  polygons  is 
given  as  well. 

Theorem  13.11  Let  CP  and  CR  be  polygons  with  n  and  m  vertices,  respectively. 
The  complexity  of  the  Minkowski  sum  CP  ©  CR  is  bounded  as  follows: 

(i)  it  is  0(n  +  m)  if  both  polygons  are  convex; 

(ii)  it  is  0(nm)  if  one  of  the  polygons  is  convex  and  one  is  non-convex; 

(Hi)  it  is  0(n2m 2)  if  both  polygons  are  non-convex. 

These  bounds  are  tight  in  the  worst  case. 

Computing  Minkowski  sums  of  non-convex  polygons  is  not  very  difficult: 
triangulate  both  polygons,  compute  the  Minkowski  sum  of  each  pair  of  triangles, 
and  take  their  union.  This  approach  is  basically  the  same  as  the  approach 
described  in  the  next  section  for  computing  the  forbidden  space  of  a  translating 
robot,  so  we  omit  the  details  here. 


13.4  Translational  Motion  Planning 

It  is  time  to  return  to  the  planar  motion  planning  problem.  Recall  that  our  robot 
CR  can  only  translate  and  that  the  obstacles  are  disjoint  polygons.  Early  in  the 
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previous  section  we  have  shown  that  the  C -obstacle  corresponding  to  an  obstacle 
IP,  is  the  Minkowski  sum  CP,-  ®  (—Ik).  Moreover,  we  have  seen  that  Minkowski 
sums  of  convex  polygons  are  pseudodiscs.  We  use  this  to  prove  our  first  major 
result  on  the  motion  planning  problem,  which  states  that  the  complexity  of  the 
free  space  of  a  translating  planar  robot  is  linear. 

Theorem  13.12  Let  CR  be  a  convex  robot  of  constant  complexity,  translating 
among  a  setS  of  non-intersecting  polygonal  obstacles  with  a  total  ofn  edges. 
Then  the  complexity  of  the  free  configuration  space  Cfree(lR,S)  is  0(n). 

Proof.  First  we  triangulate  each  obstacle  polygon.  We  get  a  set  of  0(n)  tri¬ 
angular,  and  hence  convex,  obstacles  with  disjoint  interiors.  The  free  con¬ 
figuration  space  is  the  complement  of  the  union  of  the  C-obstacles  of  these 
triangles.  Because  the  robot  has  constant  complexity,  the  C-obstacles  have  con¬ 
stant  complexity,  and  according  to  Theorem  13.8  they  form  a  set  of  pseudodiscs. 
Theorem  13.9  now  implies  that  the  union  has  linear  complexity.  0 

It  remains  to  find  an  algorithm  to  construct  the  free  space.  Rather  than 
computing  the  free  space  Cfree,  we  shall  compute  the  forbidden  space  CforC  the 
free  space  is  simply  its  complement. 

Let  CPi , .  -  - ,  CP„  denote  the  triangles  that  we  get  when  we  triangulate  the 
obstacles.  We  want  to  compute 

n  n 

Cforb  =  UG;Pt  =  UW-^0’0))' 

1=1  i=l 

In  Section  13.3  we  saw  how  to  compute  the  individual  Minkowski  sums  CCP 
To  compute  their  union  we  use  a  simple  divide-and-conquer  approach. 

Algorithm  ForbiddenSpace(CCPi  , . . . ,  CCP „) 

Input.  A  collection  of  C-obstacles. 

Output.  The  forbidden  space  Cforb  =  UiLi  CCP;. 

1 .  if  n  =  1 

2.  then  return  CCP  i 

3.  else  Cforb  ^ForbiddenSpace(?i  , . . . ,  CP|-„/2] ) 

4-  GLb  ^FORBIDDENSPACE(?r„/21  +  1,...,J>„) 

5 .  Compute  Cforb  =  C|orb  U  Cj?orb . 

6.  return  Cforb 

The  heart  of  this  algorithm  is  the  subroutine  to  compute  the  union  of  two  planar 
regions,  which  we  need  to  perform  the  merge  step  (line  5).  If  we  represent 
these  regions  by  doubly-connected  edge  lists,  this  can  be  done  by  the  overlay 
algorithm  described  in  Chapter  2. 

The  following  lemma  summarizes  the  result. 
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Lemma  13.13  The  free  configuration  space  Cfree  of  a  convex  robot  of  constant 
complexity  translating  among  a  set  of  polygons  with  n  edges  in  total  can  be 
computed  in  0(nlog2  n)  time. 


Proof.  In  Chapter  3  we  saw  that  a  polygon  with  m  vertices  can  be  triangulated  Section  13.5* 

in  O(mlogm)  time.  (In  fact,  it  can  even  be  done  in  0(m)  time  with  a  very  motion  planning  with  rotations 

complicated  algorithm,  as  stated  in  the  notes  and  comments  of  Chapter  3.) 

Hence,  if denotes  the  complexity  of  obstacle  CP,-,  then  triangulating  all  the 
obstacles  takes  time  proportional  to 

t  t 

Y,  m,  log  w,  ^  y  irij  \ogn  =  n  logn. 
i=l  i=  1 

Computing  the  C-obstacles  of  each  of  the  resulting  triangles  takes  linear  time  in 
total.  It  remains  to  bound  the  time  that  ForbiddenSpace  needs  to  compute 
the  union  of  the  C-obstacles. 

Using  the  results  from  Chapter  2,  the  merge  step  (line  5)  can  be  done  in 
0((m  +  m  +  k)  log(«i  +H2))  where  m,  112,  and  k  denote  the  complexity  of 
C'orb,  Cforb,  and  CforbUC2orb.  Theorem  13.12  states  that  the  complexity  of  the 
free  space — and.  hence,  of  the  forbidden  space — is  linear  in  the  sum  of  the 
complexities  of  the  obstacles.  In  our  case  this  means  that  n\,  112,  and  k  are 
all  0(n),  so  the  time  for  the  merge  step  is  0(n  log/;)-  We  get  the  following 
recurrence  for  T(n),  the  time  the  algorithm  needs  when  applied  to  a  set  of  n 
constant-complexity  C-obstacles: 

T(n)  =  T(\n/2-])  +  T{[n/2\)  +  0(n\ogn). 

The  solution  of  this  recurrence  is  O(nlog2n).  0 

The  result  of  this  theorem  is  not  the  best  possible — see  the  notes  and  com¬ 
ments  of  this  chapter. 


Now  that  we  have  computed  the  free  space,  we  can  continue  in  exactly  the 
same  way  as  in  Section  13.2:  we  compute  a  trapezoidal  map  of  the  free  space, 
together  with  a  roadmap.  Given  a  start  and  a  goal  placement  of  the  robot  1R, 
we  find  a  path  as  follows.  First,  we  map  the  start  and  goal  placement  to  points 
in  the  configuration  space.  Then  we  compute  a  path  between  these  two  points 
through  the  free  space  using  the  trapezoidal  map  and  the  road  map,  as  described 
in  Section  13.2.  Finally,  we  map  the  path  back  to  a  path  for  1R  in  the  work  space. 

The  next  theorem  summarizes  the  result  of  our  efforts. 


Theorem  13.14  Let  (R  be  a  convex  robot  of  constant  complexity  translating 
among  a  set  S  of  disjoint  polygonal  obstacles  with  11  edges  in  total.  We  can 
preprocess  S  in  0(n  log2  n)  expected  time,  such  that  between  any  start  and  goal 
position  a  collision-free  path  for  Jv,  if  it  exists,  can  be  computed  in  0(n)  time. 


13.5*  Motion  Planning  with  Rotations 

In  the  previous  sections  the  robot  was  only  allowed  to  translate.  When  the  robot 

is  circular  this  does  not  limit  its  possible  motion.  On  the  other  hand,  when  it  is  _ 

long  and  skinny,  translational  motion  is  often  not  enough:  it  may  have  to  change  299 
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its  orientation  to  be  able  to  pass  through  a  narrow  passage  or  to  go  around  a 
corner.  In  this  section  we  sketch  a  method  to  plan  motion  for  robots  that  can 
rotate  as  well  as  translate. 


Let  ft  be  a  convex  polygonal  robot  that  can  translate  and  rotate  in  a  planar  work 
space  that  contains  a  set  fti , . . . ,  ft,  of  disjoint  polygonal  obstacles.  The  robot 
ft  has  three  degrees  of  freedom:  two  translational  and  one  rotational  degree  of 
freedom.  Hence,  we  can  specify  a  placement  for  ft  by  three  parameters:  the 
x-  and  y-coordinate  of  a  reference  point  of  ft,  and  an  angle  (!)  that  specifies  its 
orientation.  As  in  Section  13.1,  we  use  ft(x,y,  0)  to  denote  the  robot  placed 
with  its  reference  point  at  ( x,y )  and  rotated  over  an  angle  0. 

The  configuration  space  that  we  get  is  the  3-dimensional  space  M2  x  [0 :  360), 
with  a  topology  where  points  (x,y,0)  and  (x,y,  360)  are  identified.  Recall  that 
Cft;,  the  C-obstacle  of  an  obstacle  ft,-,  is  defined  as  follows: 

eft;  :={(x,y,0)GK2x  [0:360)  :  ft(x,y,0)  ©ft;  ^  0}. 

What  do  these  C -obstacles  look  like?  This  question  is  difficult  to  answer  directly, 
but  we  can  get  an  idea  by  looking  at  cross-sections  with  planes  of  constant  0. 
In  such  a  plane  the  rotation  angle  is  fixed,  so  we  are  dealing  with  a  purely 
translational  problem.  Hence,  we  know  the  shape  of  the  cross-section:  it  is 
a  Minkowski  sum.  More  precisely,  the  cross-section  of  Cft,  with  the  plane 
h  :  0  =  0o  is  equal  to  ft,-  ©  ft(0,0, 0o).  (More  precisely,  it  is  a  copy  of  the 
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Figure  13.10 
The  C-obstacle  of  a  rotating  and 
translating  robot 


Minkowski  sum  placed  at  height  0 o.)  Now  imagine  sweeping  a  horizontal 
plane  upwards  through  configuration  space,  starting  at  0  =  0  and  ending  at 
0  =  360.  At  any  time  during  the  sweep  the  cross-section  of  the  plane  with  Cft; 
is  a  Minkowski  sum.  The  shape  of  the  Minkowski  sum  changes  continuously:  at 
0  =  0 o  the  cross-section  is  ft,;  ©  ft(0, 0, 0o),  and  at  0  =  0o  +  £  the  cross-section 
is  ft,-  ©  ft(0,0, 0o  +  e).  This  means  that  Cft,  looks  like  a  twisted  pillar,  as  in 

_  Figure  13.10.  The  edges  and  facets  of  this  twisted  pillar,  except  for  the  top  facet 

300  and  bottom  facet,  are  curved. 


So  we  know  more  or  less  what  C-obstacles  look  like.  The  free  space  is  the  Section  13.5* 

complement  of  the  union  of  these  C-obstacles.  Due  to  the  nasty  shape  of  the  motion  planning  with  rotations 

C-obstacles,  the  free  space  is  rather  complicated:  its  boundary  is  no  longer 

polygonal,  but  curved.  Moreover,  the  combinatorial  complexity  of  the  free 

space  can  be  quadratic  for  a  convex  robot,  and  even  cubic  for  a  non-convex 

robot.  Nevertheless,  we  can  solve  the  motion  planning  problem  using  the  same 

approach  we  took  before:  compute  a  decomposition  of  the  free  space  into  simple 

cells,  and  construct  a  road  map  to  guide  the  motions  between  neighboring  cells. 

Given  a  start  and  goal  placement  of  the  robot  we  then  find  a  path  as  follows. 

We  map  these  placements  to  points  in  configuration  space,  find  the  cells  that 
contain  the  points,  and  construct  a  path  consisting  of  three  parts:  a  path  from  the 
start  position  to  the  node  of  the  road  map  in  the  center  of  the  start  cell,  a  path 
along  the  road  map  to  the  node  in  the  center  of  the  goal  cell,  and  a  path  inside 
the  goal  cell  to  the  final  destination.  It  remains  to  map  the  path  in  configuration 
space  back  to  a  motion  in  the  work  space. 

Because  of  the  complex  shape  of  the  C-obstacles,  it  is  difficult  to  compute  a 
suitable  cell  decomposition,  especially  when  it  comes  to  an  actual  implemen¬ 
tation.  Therefore  we  shall  describe  a  different,  simpler  approach.  As  we  shall 
see,  however,  this  approach  has  its  drawbacks  as  well.  Our  approach  is  based 
on  the  same  observation  we  used  to  study  the  shape  of  the  C-obstacles,  namely 
that  the  motion  planning  problem  reduces  to  a  purely  translational  problem  if 
we  restrict  the  attention  to  a  horizontal  cross-section  of  the  configuration  space. 

We  will  call  such  a  cross-section  a  slice.  The  idea  is  to  compute  a  finite  number 
of  slices.  A  path  for  the  robot  now  consists  of  two  types  of  motion:  motions 
within  a  slice — these  are  purely  translational — and  motions  from  one  slice  to 
the  next  or  previous  one — these  will  be  purely  rotational. 


Let’s  formalize  this.  Let  z  denote  the  number  of  slices  we  take.  For  every  integer 
i  with  0<i<z— 1,  let  0,  =  i  x  (360/z).  We  compute  a  slice  of  the  free  space 
for  each  0,.  Since  within  the  slice  we  are  dealing  with  a  purely  translational 
problem  for  the  robot  fR(0,0, 0,),  we  can  compute  the  slice  using  the  methods  of 
the  previous  section.  This  will  give  us  the  trapezoidal  map  'T;  of  the  free  space 
within  the  slice.  For  each  T,  we  compute  a  road  map  S,.  These  road  maps  are 
used  to  plan  the  motions  within  a  slice,  as  in  Section  13.2. 

It  remains  to  connect  consecutive  slices.  More  precisely,  we  connect  every 
pair  of  roadmaps  5,,  S;+i  to  obtain  a  roadmap  Sroad  of  the  entire  configuration 
space.  This  is  done  as  follows.  We  take  the  trapezoidal  maps  of  each  pair  of 
consecutive  slices,  and  compute  their  overlay  with  the  algorithm  of  Chapter  2. 
(Strictly  speaking,  we  should  say  that  we  compute  the  overlay  of  the  projections 
of  T,  and  Ti+i  onto  the  plane  /;  :  0  =  0.)  This  tells  us  all  the  pairs  Ai,At 
with  Ai  £  T;-  and  A2  £  Tj+i  such  that  A]  intersects  A2.  Let  (x,y,  0)  be  a  point 
in  A]  HA2.  We  then  add  an  extra  node  to  Sroad  at  (x,;y,  0;)  and  at  (x,y,  0,-+i), 
which  we  connect  by  an  arc.  Moving  from  one  slice  to  the  other  along  this  arc 
corresponds  to  a  rotation  from  0,  to  0,+i,  or  back.  Furthermore,  the  node  at 
(x,y,  0,)  is  connected  to  the  node  at  the  center  of  A] ,  and  the  node  at  (x,y,  0,+i ) 
is  connected  to  the  node  at  the  center  of  A?.  These  connections  stay  within  a 
slice,  so  they  correspond  to  purely  translational  motions.  We  connect  S2-i  and 
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Chapter  13  So  in  the  same  way.  Note  that  paths  in  the  graph  Sroad  correspond  to  paths  of  the 

robot  motion  planning  robot  that  are  composed  of  purely  translational  motion  (when  we  move  along 

an  arc  connecting  nodes  in  the  same  slice)  and  purely  rotational  motion  (when 
we  move  along  an  arc  connecting  nodes  in  different  slices). 

Once  we  have  constructed  this  road  map,  we  can  use  it  to  plan  a  mo¬ 
tion  for  from  any  start  placement  3i(xstait,}'start,  0start)  to  any  goal  placement 
K(xg0ai;>’goai; 0 goai)-  To  do  this,  we  first  determine  the  slices  closest  to  the  start 
and  goal  placement  by  rounding  the  orientations  0start  and  0goa  1  to  the  nearest 
orientation  0,-  for  which  we  constructed  a  slice.  Within  those  slices  we  deter¬ 
mine  the  trapezoids  Astart  and  Agoa]  containing  the  start  and  goal  position.  If 
one  of  these  trapezoids  does  not  exist  because  the  start  or  goal  position  lies  in 
the  forbidden  space  within  the  slice,  then  we  report  that  we  cannot  compute 
a  path.  Otherwise,  let  vstart  and  vgoai  be  the  nodes  of  the  road  map  that  have 
been  placed  in  their  center.  We  try  to  find  a  path  in  Sroad  from  vstart  to  vgoai 
using  breadth-first  search.  If  there  is  no  path  in  the  graph,  we  report  that  we 
cannot  compute  a  motion.  Otherwise  we  report  a  motion  consisting  of  five 
parts:  a  purely  rotational  motion  from  the  start  position  to  the  nearest  slice,  a 
purely  translational  motion  within  that  slice  to  the  node  vstart,  a  motion  that 
corresponds  to  the  path  from  vstart  to  vgoai  in  Sroad,  a  purely  translational  motion 
from  vgoai  to  the  final  position  within  that  slice  (which  is  the  slice  nearest  to  the 
goal  position),  and  finally  a  purely  rotational  motion  to  the  real  goal  position. 


This  method  is  a  generalization  of  the  method  we  used  for  translating  motions, 
but  is  has  a  major  problem:  it  is  not  always  correct.  Sometimes  it  may  erro¬ 
neously  report  that  a  path  does  not  exist.  For  instance,  the  start  position  can  be 
in  the  free  space,  whereas  the  start  position  within  the  nearest  slice  is  not.  In 
this  case  we  report  that  there  is  no  path,  which  need  not  be  true.  Even  worse  is 
that  the  paths  we  report  need  not  be  collision-free.  The  translational  motions 
within  a  slice  are  okay,  because  we  solved  the  problem  within  a  slice  exactly, 
but  the  rotational  motions  from  one  slice  to  the  next  may  cause  problems:  the 
placements  within  the  two  slices  are  collision-free,  but  halfway  the  robot  could 
collide  with  an  obstacle.  Both  problems  are  less  likely  to  occur  when  we  in¬ 
crease  the  number  of  slices,  but  we  can  never  be  certain  of  the  correctness  of 
the  result.  This  is  especially  bothersome  for  the  second  problem:  we  definitely 
don’t  want  our  possibly  very  expensive  robot  to  have  a  collision. 


Figure  13.11 
Enlarging  the  robot 


Therefore  we  use  the  following  trick.  We  make  the  robot  slightly  larger, 
and  use  the  method  described  above  on  the  enlarged  robot  51' .  This  is  done  in 
such  a  way  that  although  51'  can  collide  during  rotations,  the  original  robot  3? 


cannot.  To  achieve  this,  the  robot  is  enlarged  as  follows.  Rotate  J!  clockwise  Section  13.6 

and  counterclockwise  over  an  angle  of  (180/z)°.  During  this  rotation  3?  sweeps  notes  and  comments 

a  part  of  the  plane.  We  use  for  the  enlarged  robot  JT  a  convex  polygon  that 

contains  the  sweep  area — see  Figure  13.11.  We  now  compute  the  trapezoidal 

maps  and  the  road  map  for  3T  instead  of  Jv.  It  is  not  difficult  to  prove  that  Jv 

cannot  collide  with  an  obstacle  during  a  purely  rotational  motion  between  two 

adjacent  slices,  even  though  3T  can.  By  enlarging  the  robot  we  have  introduced 

another  way  to  incorrectly  decide  that  there  is  no  path.  Again,  this  becomes  less 

likely  when  the  number  of  slices  increases.  So  with  a  large  number  of  slices, 

the  method  probably  performs  reasonably  well  in  practical  situations. 


13.6  Notes  and  Comments 

The  motion-planning  problem  has  received  a  lot  of  attention  over  the  years, 
both  from  people  working  in  computational  geometry  and  from  people  work¬ 
ing  in  robotics,  and  this  chapter  only  scratches  the  surface  of  all  the  research. 
A  much  more  extensive  treatment  of  the  problem  is  given  by  Latombe  [243]. 
Sharir  [352]  gives  a  survey  of  the  theoretical  results  that  have  been  obtained. 
Nevertheless,  the  concepts  we  have  introduced — configuration  space,  decompo¬ 
sition  of  the  free  space,  road  maps  that  transform  the  geometric  problem  into  a 
graph  searching  problem — underly  the  majority  of  approaches  that  have  been 
proposed. 

These  concepts  date  back  to  the  work  of  Lozano-Perez  [258,  259,  260]. 
An  important  difference  between  his  method  and  the  method  of  this  chapter  is 
that  he  used  an  approximate  decomposition  of  the  free  space.  The  approach  of 
Section  13.2,  which  uses  an  exact  decomposition  of  the  free  space  of  a  planar 
translating  robot  into  trapezoids,  is  based  on  more  recent  work  by  Kedem  et 
al.  [231,  232].  An  improved  algorithm,  which  runs  in  O(n\ogn)  time,  was  given 
by  Bhattacharya  and  Zorbas  [68], 

A  very  general  method  that  is  based  on  finding  an  exact  cell  decomposition 
of  the  free  space  was  given  by  Schwartz  and  Sharir  [341].  It  is  based  on  a 
decomposition  method  of  Collins  [136].  Unfortunately,  this  method  takes  time 
doubly  exponential  in  the  dimension  of  the  configuration  space.  This  can  be 
improved  using  a  decomposition  method  of  Chazelle  et  al.  [102]. 

In  this  chapter  we  have  seen  that  the  cell  decomposition  approach  leads 
to  an  0(n  log2  n)  algorithm  when  applied  to  a  convex  robot  translating  in  the 
plane.  The  bottleneck  in  the  algorithm  was  the  computation  of  the  union  of 
a  collection  of  Minkowski  sums.  Using  a  randomized  incremental  algorithm, 
instead  of  a  divide-and-conquer  algorithm,  this  step  can  be  done  in  0{n\ogn) 
time  [58,  280].  Our  approach  was  based  on  the  fact  that  the  Minkowski  sums  of 
convex  disjoint  polygons  with  some  fixed  other  convex  polygon  form  a  set  of 
pseudodiscs,  and  that  the  union  complexity  of  these  pseudodiscs  is  0(n ).  The 
latter  statement  is  even  true  for  non-polygonal  pseudodiscs:  there  are  at  most 
6n  —  12  breakpoints  (that  is,  boundary  intersections  of  two  pseudodiscs)  on  the 
boundary  of  the  union  of  n  pseudodiscs  [231].  Here  a  set  of  objects  is  defined 
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to  be  a  set  of  pseudodiscs  if  the  boundaries  of  any  pair  of  them  intersect  in  at 
most  two  points.  Extensions  of  this  result,  and  many  other  results  on  the  union 
complexity  of  certain  families  of  objects  are  discussed  in  a  recent  survey  by 
Agarwal  et  al.  [3], 

The  translational  motion  planning  problem  in  3-dimensional  space  can  be 
solved  in  0(»2log3»)  time  [22], 

The  approach  we  sketched  for  robots  that  can  translate  and  rotate  is  ap¬ 
proximate:  it  isn’t  guaranteed  to  find  a  path  if  it  exists.  It  is  possible  to  find 
an  exact  solution  by  computing  an  exact  cell  decomposition  of  the  free  space 
in  0(n3)  time  [33].  For  a  convex  robot,  the  running  time  can  be  reduced  to 
0(n2 log2 n)  [233]. 

The  free  space  of  a  robot  may  consist  of  a  number  of  disconnected  com¬ 
ponents.  Of  course,  the  motions  of  the  robot  are  confined  to  the  component 
where  it  starts;  to  go  to  another  component  it  would  have  to  pass  through  the 
forbidden  space.  Hence,  it  is  sufficient  to  compute  only  a  single  cell  in  the  free 
space,  instead  of  the  entire  free  space.  Usually  the  worst-case  complexity  of  a 
single  cell  is  one  order  of  magnitude  lower  than  the  complexity  of  the  entire  free 
space.  This  can  be  used  to  speed  up  the  asymptotic  running  time  of  the  motion 
planning  algorithms.  The  book  by  Agarwal  and  Sharir  [353]  and  the  thesis  by 
Halperin  [205]  discuss  single  cells  and  their  connection  to  motion  planning  at 
length. 

The  theoretical  complexity  of  the  motion  planning  is  exponential  in  the 
number  of  degrees  of  freedom  of  the  robot,  which  makes  the  problem  appear  in¬ 
tractable  for  high  DOF  robots.  Under  some  mild  restrictions  on  the  shape  of  the 
robot  and  the  obstacles — which  are  likely  to  be  satisfied  in  practical  situations — 
one  can  show  that  the  complexity  of  the  free  space  is  only  linear  [364,  365], 


Cell  decomposition  methods  are  not  the  only  exact  methods  for  motion  planning. 
Another  approach  is  the  so-called  retraction  method.  Here  a  road  map  is 
constructed  directly,  without  decomposing  the  free  space.  Furthermore,  a 
retraction  function  is  defined,  which  maps  any  point  in  the  free  space  to  a  point 
on  the  road  map.  Once  this  has  been  done,  paths  can  be  found  by  retracting 
both  start  and  goal  to  the  road  map  and  next  following  a  path  along  the  road 
map.  Different  types  of  road  maps  and  retraction  functions  have  been  proposed. 
A  nice  road  map  is  the  Voronoi  diagram,  because  it  stays  as  far  away  from  the 
obstacles  as  possible.  If  the  robot  is  a  disc,  then  we  can  use  the  normal  Voronoi 
diagram — this  was  already  discussed  in  Section  7.3.  Otherwise  one  has  to  use 
a  different  distance  function  in  the  definition  of  the  Voronoi  diagram,  which 
depends  on  the  shape  of  the  robot.  Still,  such  a  diagram  can  often  be  computed 
in  O(n\ogn)  time  [255,  296],  leading  to  another  0(n  log/;]  time  algorithm  for 
translational  motion  planning.  A  very  general  road  map  method  has  been 
proposed  by  Canny  [80].  It  can  solve  almost  any  motion  planning  problem  in 
time  0(nd  log/;),  where  d  is  the  dimension  of  the  configuration  space,  that  is, 
the  number  of  degrees  of  freedom  of  the  robot.  Unfortunately,  the  method  is 
very  complicated,  and  it  has  the  disadvantage  that  most  of  the  time  the  robot 
moves  in  contact  with  an  obstacle.  This  is  often  not  the  preferred  type  of  motion. 


304 


In  this  chapter  we  have  concentrated  mainly  on  exact  motion  planning.  There 
are  also  a  number  of  heuristic  approaches. 

For  instance,  one  can  use  approximate  cell  decompositions  [74,  258,  259, 
398]  instead  of  exact  ones.  These  are  often  based  on  quadtrees. 

Another  heuristic  is  the  potential  field  method  [39,  235,  378].  Here  one 
defines  a  potential  field  on  the  configuration  space  by  making  the  goal  position 
attract  the  robot  and  making  the  obstacles  repel  it.  The  robot  then  moves  in  the 
direction  dictated  by  the  potential  field.  The  problem  with  this  approach  is  that 
the  robot  can  get  stuck  in  local  minima  of  the  potential  field.  Various  techniques 
have  been  proposed  to  escape  the  minima. 

Another  heuristic  which  has  recently  become  popular  is  the  probabilistic 
road  map  method  [230,  310].  It  computes  a  number  of  random  placements  for 
the  robot,  which  are  connected  in  some  way  to  form  a  road  map  of  the  free 
space.  The  road  map  can  then  be  used  to  plan  paths  between  any  given  start  and 
goal  placement. 

Minkowski  sums  play  an  important  role  not  only  in  motion  planning,  but  also 
in  other  problems.  An  example  is  the  problem  of  placing  one  polygon  inside 
another  [119];  this  can  be  useful  if  one  wants  to  cut  out  some  shape  from  a  piece 
of  fabric.  For  some  basic  results  on  properties  of  Minkowski  sums  and  their 
computation  we  refer  to  [43,  197]. 

In  this  chapter  we  concentrated  on  finding  some  path  for  the  robot,  but  we  didn’t 
try  to  find  a  short  path.  This  is  the  topic  of  Chapter  15. 

Finally,  we  note  that  we  allowed  paths  where  the  robot  touches  an  obstacle. 
Such  paths  are  sometimes  called  “semi-free”  [243,  340].  Paths  that  do  not  touch 
any  obstacle  are  then  called  “free”.  It  is  useful  to  be  aware  of  these  terms  when 
studying  the  motion  planning  literature. 


13.7  Exercises 

13.1  Let  X  be  a  robotic  arm  with  a  fixed  base  and  seven  links.  The  last  joint 
of  3?  is  a  prismatic  joint,  the  other  ones  are  revolute  joints.  Give  a  set  of 
parameters  that  determines  a  placement  of  3L  What  is  the  dimension  of 
the  configuration  space  resulting  from  your  choice  of  parameters? 

13.2  In  the  road  map  Sroad  that  was  constructed  on  the  trapezoidal  decomposi¬ 
tion  of  the  free  space  we  added  a  node  in  the  center  of  each  trapezoid  and 
on  each  vertical  wall.  It  is  possible  to  avoid  the  nodes  in  the  center  of 
each  trapezoid.  Show  how  the  graph  can  be  changed  such  that  only  nodes 
on  the  vertical  walls  are  required.  (Avoid  an  increase  in  the  number  of 
edges  in  the  graph.)  Explain  how  to  adapt  the  query  algorithm. 

13.3  Prove  that  the  shape  of  G IP;  is  independent  of  the  choice  of  the  reference 
point  in  the  robot  3L 
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13.4  Draw  the  Minkowski  sum  ©i  ©  ©2  for  the  case  where 

a.  both  ©1  and  ©2  are  unit  discs; 

b.  both  ©1  and  ©2  are  unit  squares; 

c.  ©1  is  a  unit  disc  and  ©2  is  a  unit  square; 

d.  ©1  is  a  unit  square  and  ©2  is  a  triangle  with  vertices  (0,0),  (1,0),  (0, 1). 


13.5  Let  ©1  and  ©2  be  two  convex  polygons.  Let  Si  be  the  collection  of 
vertices  of  ©1  and  S2  be  the  collection  of  vertices  of  ©2.  Prove  that 


©1  ©  ©2  =  ConvexHull{S\  ©S2). 


13.6  Prove  Observation  13.4. 

13.7  In  Theorem  13.9  we  gave  an  0{n)  bound  on  the  complexity  of  the 
union  of  a  set  of  polygonal  pseudodiscs  with  n  vertices  in  total.  We  are 
interested  in  the  precise  bound. 

a.  Assume  that  the  union  boundary  contains  m  original  vertices  of  the 
polygons.  Show  that  the  complexity  of  the  union  boundary  is  at  most 
2 n  —  m.  Use  this  to  prove  an  upper  bound  of  2 n  —  3  on  the  complexity 
of  the  union  boundary. 

b.  Prove  a  lower  bound  of  2 n  —  6  by  constructing  an  example  that  has 
this  complexity. 
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14  Quadtrees 


Non-Uniform  Mesh  Generation 


Almost  all  electrical  devices,  from  shavers  and  telephones  to  televisions  and 
computers,  contain  some  electronic  circuitry  to  control  their  functioning.  This 
circuitry — VLSI  circuits,  resistors,  capacitors,  and  other  electric  components — 
is  placed  on  a  printed  circuit  board.  To  design  printed  circuit  boards  one  has  to 
decide  where  to  place  the  components,  and  how  to  connect  them.  This  raises  a 
number  of  interesting  geometric  problems,  of  which  this  chapter  tackles  one: 
mesh  generation. 

Many  components  on  a  printed  circuit  board  emit  heat  during  operation.  In  order 
for  the  board  to  function  properly,  the  emission  of  heat  should  be  below  a  certain 
threshold.  It  is  difficult  to  predict  in  advance  whether  the  heat  emission  will 
cause  problems,  since  this  depends  on  the  relative  positions  of  the  components 
and  the  connections.  In  former  days  one  therefore  made  a  prototype  of  the  board 
to  determine  the  heat  emission  experimentally.  If  it  turned  out  to  be  too  high, 
an  alternative  layout  had  to  be  designed.  Today  the  experiments  can  often  be 
simulated.  Because  the  design  is  largely  automated,  a  computer  model  of  the 
board  is  readily  available  and  simulation  is  a  lot  faster  than  building  a  prototype. 
Simulation  also  allows  testing  during  the  initial  stages  of  the  design  phase,  so 
that  faulty  designs  can  be  rejected  as  early  as  possible. 

The  heat  transfer  between  different  materials  on  the  printed  circuit  board  is 
a  quite  complicated  process.  To  simulate  the  heat  processes  on  the  board  one 
therefore  has  to  resort  to  approximation  using  finite  element  methods.  Such 
methods  first  divide  the  board  into  many  small  regions,  or  elements.  The 
elements  are  usually  triangles  or  quadrilaterals.  The  heat  that  each  element 
emits  by  itself  is  assumed  to  be  known.  It  is  also  assumed  to  be  known  how 
neighboring  elements  influence  each  other.  This  leads  to  a  big  system  of 
equations,  which  is  then  solved  numerically. 

The  accuracy  of  finite  element  methods  depends  heavily  on  the  mesh:  the 
finer  the  mesh,  the  better  the  solution.  The  other  side  of  the  coin  is  that  the 
computation  time  for  the  numerical  process  increases  drastically  when  the 
number  of  elements  increases.  So  we  would  like  to  use  a  fine  mesh  only  where 
necessary.  Often  this  is  at  the  border  between  regions  of  different  material.  It 
is  also  important  that  the  mesh  elements  respect  the  borders,  that  is,  that  any 
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Chapter  14  mesh  element  is  contained  in  only  one  region.  Finally,  the  shape  of  the  mesh 

quadtrees  elements  plays  an  important  role:  irregularly  shaped  elements  such  as  very  thin 

triangles  often  lead  to  a  slower  convergence  of  the  numerical  process. 


14.1  Uniform  and  Non-Uniform  Meshes 

We'll  study  the  following  variant  of  the  mesh  generation  problem.  The  input 
is  a  square — the  printed  circuit  board — with  a  number  of  disjoint  polygonal 
components  inside  it.  The  square  together  with  the  components  is  sometimes 
called  the  domain  of  the  mesh.  The  vertices  of  the  square  are  at  (0,0),  (0,f/), 
(f/,0),  ( U .  U ) ,  where  U  =  V  for  a  positive  integer  j.  The  coordinates  of  the 
vertices  of  the  components  are  assumed  to  be  integers  between  0  and  U.  We 
make  one  more  assumption,  which  is  satisfied  in  a  number  of  applications:  the 
edges  of  the  components  have  only  four  different  orientations.  In  particular,  the 
angle  that  an  edge  makes  with  the  x-axis  is  either  0°,  45°,  90°,  or  135°. 

Our  goal  is  to  compute  a  triangular  mesh  of  the  square,  that  is,  a  subdivi¬ 
sion  of  the  square  into  triangles.  We  require  the  mesh  to  have  the  following 
properties: 

■  The  mesh  must  be  conforming:  a  triangle  is  not  allowed  to  have  a  vertex  of 
another  triangle  in  the  interior  of  one  of  its  edges. 

■  The  mesh  must  respect  the  input:  the  edges  of  the  components  must  be 
contained  in  the  union  of  the  edges  of  the  mesh  triangles. 

■  The  mesh  triangles  must  be  well-shaped:  the  angles  of  any  mesh  triangle 
should  not  be  too  large  nor  too  small.  In  particular,  we  require  them  to  be  in 
the  range  from  45°  to  90°. 

Finally,  we  would  like  the  mesh  to  be  fine  only  where  necessary.  Where  this  is 
depends  on  the  application.  The  property  we  shall  require  is  as  follows. 

■  The  mesh  must  be  non-uniform:  it  should  be  fine  near  the  edges  of  the 
components  and  coarse  far  away  from  the  edges. 
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We  have  already  seen  triangulations  earlier  in  this  book.  Chapter  3  presented 
an  algorithm  for  triangulating  a  simple  polygon,  and  Chapter  9  presented  an 
algorithm  for  triangulating  a  point  set.  The  latter  algorithm  computes  the 
Delaunay  triangulation,  a  triangulation  that  maximizes  the  minimum  angle  over 
all  possible  triangulations.  Given  our  restriction  on  the  angles  of  the  mesh 
triangles  this  seems  quite  useful,  but  there  are  two  problems. 

First  of  all,  a  triangulation  of  the  vertices  of  the  components  need  not  respect 
the  edges  of  the  components.  Even  if  it  did,  there  is  a  second  problem:  there 
can  still  be  angles  that  are  too  small.  Suppose  that  the  input  is  a  square  with 
side  length  16  and  one  component,  which  is  a  small  square  with  side  length  1 
placed  in  the  top  left  corner  at  distance  1  from  the  left  and  the  top  side  of  the 
square.  Then  the  Delaunay  triangulation  contains  triangles  that  have  an  angle  of 
less  than  5°.  Since  the  Delaunay  triangulation  maximizes  the  minimum  angle,  it 
seems  impossible  to  generate  a  mesh  with  only  well-shaped  triangles.  But  there 
is  a  catch:  unlike  in  a  tri angulation,  the  triangles  in  a  mesh  are  not  required  to 
have  their  vertices  at  the  input  points.  We  are  allowed  to  add  extra  points,  called 
Steiner  points,  to  help  us  obtain  well-shaped  triangles.  A  triangulation  that 
uses  Steiner  points  is  sometimes  called  a  Steiner  triangulation.  In  our  example, 
if  we  add  a  Steiner  point  at  every  grid  point  inside  the  square,  then  we  can 
easily  obtain  a  mesh  consisting  only  of  triangles  with  two  45°  angles  and  one 
90°  angle — see  the  mesh  shown  on  the  left  in  Figure  14.2.  Unfortunately,  this 
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mesh  suffers  from  another  problem:  it  uses  small  triangles  everywhere,  not  only 
near  the  edges  of  the  input,  so  it  is  a  uniform  mesh.  As  a  result,  it  has  many 
triangles.  We  cannot  simply  replace  all  the  triangles  in,  say,  the  bottom  right 
quarter  of  the  square  by  two  big  triangles,  because  then  the  mesh  would  no 
longer  be  conforming.  Nevertheless,  if  we  gradually  increase  the  size  of  the 
triangles  when  we  get  farther  away  from  top  left  corner,  then  it  is  possible  to 
get  a  conforming  mesh  with  only  well-shaped  triangles,  as  shown  on  the  right 
in  Figure  14.2.  This  leads  to  a  significantly  smaller  number  of  triangles:  the 
uniform  mesh  has  512  triangles,  whereas  the  non-uniform  one  only  has  52. 


14.2  Quadtrees  for  Point  Sets 


Section  14.2 

QUADTREES  FOR  POINT  SETS 


Figure  14.2 

A  uniform  and  a  non-uniform  mesh 


The  non-uniform  mesh  generation  method  we  shall  describe  in  the  next  section 
is  based  on  quadtrees.  A  quadtree  is  a  rooted  tree  in  which  every  internal  node 
has  four  children.  Every  node  in  the  quadtree  corresponds  to  a  square.  If  a 
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node  v  has  children,  then  their  corresponding  squares  are  the  four  quadrants 
of  the  square  of  v — hence  the  name  of  the  tree.  This  implies  that  the  squares 
of  the  leaves  together  form  a  subdivision  of  the  square  of  the  root.  We  call 
this  subdivision  the  quadtree  subdivision.  Figure  14.3  gives  an  example  of 
a  quadtree  and  the  corresponding  subdivision.  The  children  of  the  root  are 


Figure  14.3 
A  quadtree  and  the  corresponding 
subdivision 


labelled  NE,  NW,  SW,  and  SE  to  indicate  to  which  quadrant  they  correspond; 
NE  stands  for  the  north-east  quadrant,  NW  for  the  north-west  quadrant,  and  so 
on. 

Before  we  continue,  we  introduce  some  terminology  related  to  quadtree 
subdivisions.  The  faces  in  a  quadtree  subdivision  have  the  shape  of  a  square. 
Although  they  can  have  more  than  four  vertices,  we  shall  call  them  squares 
anyway.  The  four  vertices  at  the  corners  of  the  square  are  called  corner  vertices, 
or  corners  for  short.  The  line  segments  connecting  consecutive  corners  are  the 
sides  of  the  square.  The  edges  of  the  quadtree  subdivision  that  are  contained 
in  the  boundary  of  a  square  are  called  the  edges  of  the  square.  Elence,  a  side 
contains  at  least  one,  but  possibly  many  more,  edges.  We  say  that  two  squares 
are  neighbors  if  they  share  an  edge. 

Quadtrees  can  be  used  to  store  different  types  of  data.  We  will  describe  the 
variant  that  stores  a  set  of  points  in  the  plane.  In  this  case  the  recursive  splitting 
of  squares  continues  as  long  as  there  is  more  than  one  point  in  a  square.  So  the 
definition  of  a  quadtree  for  a  set  P  of  points  inside  a  square  a  is  as  follows.  Let 

C7  :=  \xa  :  4]  x  bff  :  y'o\- 

m  If  card  if)  ^  1  then  the  quadtree  consists  of  a  single  leaf  where  the  set  P 
and  the  square  o  are  stored. 

■  Otherwise,  let  One,  Onw,  Osw,  and  o$e  denote  the  four  quadrants  of  o.  Let 
An  id  :=  (At+4)/2  and  ymid  :=  {ya  +/J/2,  and  define 


Pn  e 

■=  {p  e  p 

Px  >  xmid 

and 

Py>y  mid}, 

Pn  w 

■=  {p  e  p 

Px  ^  -Xrnid 

and 

Py>ymid  }, 

Psw 

:=  {pGP 

Px  ^  -*rnid 

and 

Py  <  Amid}, 

PSE 

:=  {pGP 

Px  >  xmid 

and 

Py  f  I'm  id}- 

The  quadtree  now  consists  of  a  root  node  v  where  the  square  o  is  stored. 

_  Below  we  shall  denote  the  square  stored  at  v  by  o(v).  Furthermore,  v  has 

310  four  children: 
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QUADTREES 


■  the  NE-child  is  the  root  of  a  quadtree  for  the  set  /\'i:  inside  the  square  One.  Section  14.2 

■  the  NW-child  is  the  root  of  a  quadtree  for  the  set  /kiw  inside  the  quadtrees  for  point  sets 

square  Onw. 

■  the  SW-child  is  the  root  of  a  quadtree  for  the  set  Psw  inside  the  square  o$w. 

■  the  SE-child  is  the  root  of  a  quadtree  for  the  set  Pse  inside  the  square  Ose- 


The  choice  of  using  less-than-or-equal-to  and  greater-than  in  the  definition  of 
the  sets  lf\:.  /’nw,  Psw.  and  Pse  means  that  we  define  the  vertical  splitting  line 
to  belong  to  the  left  two  quadrants,  and  the  horizontal  splitting  line  to  the  lower 
two  quadrants. 

Every  node  v  of  the  quadtree  stores  its  corresponding  square  cr(v).  This  is 
not  necessary;  we  could  store  only  the  square  of  the  root  of  the  tree.  When  we 
walk  down  the  tree  we  would  then  have  to  maintain  the  square  of  the  current 
node.  This  alternative  uses  less  storage  at  the  expense  of  extra  computations 
that  have  to  be  done  when  the  quadtree  is  queried. 
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The  recursive  definition  of  a  quadtree  immediately  translates  into  a  recursive 
algorithm:  split  the  current  square  into  four  quadrants,  partition  the  point  set 
accordingly,  and  recursively  construct  quadtrees  for  each  quadrant  with  its 
associated  point  set.  The  recursion  stops  when  the  point  set  contains  less  than 
two  points.  The  only  detail  that  does  not  follow  from  the  recursive  definition 
is  how  to  find  the  square  with  which  to  start  the  construction.  Sometimes  this 
square  will  be  given  as  a  part  of  the  input.  If  this  is  not  the  case,  then  we 
compute  a  smallest  enclosing  square  for  the  set  of  points.  This  can  be  done  in 
linear  time  by  computing  the  extreme  points  in  the  x-  and  y-directions. 

At  every  step  of  the  quadtree  construction  the  square  containing  the  points 
is  split  into  four  smaller  squares.  This  does  not  mean  that  the  point  set  is  split 
as  well:  it  can  happen  that  all  the  points  lie  in  the  same  quadrant.  Therefore  a 
quadtree  can  be  quite  unbalanced,  and  it  is  not  possible  to  express  the  size  and 
depth  of  a  quadtree  as  a  function  of  the  number  of  points  it  stores.  However,  the 
depth  of  a  quadtree  is  related  to  the  distance  between  the  points  and  the  size  of 
the  initial  square.  This  is  made  precise  in  the  following  lemma. 

Lemma  14.1  The  depth  of  a  quadtree  for  a  set  P  of  points  in  the  plane  is  at  most 
log (s/c)  +  where  c  is  the  smallest  distance  between  any  two  points  in  P  and 
s  is  the  side  length  of  the  initial  square  that  contains  P. 

Proof.  When  we  descend  from  a  node  to  one  of  its  children,  the  size  of  the 
corresponding  square  halves.  Hence,  the  side  length  of  the  square  of  a  node  at 
depth  i  is  s/2' .  The  maximum  distance  between  two  points  inside  a  square  is 
given  by  the  length  of  its  diagonal,  which  is  sV 2/2'  for  the  square  of  a  node 
at  depth  i.  Since  an  internal  node  of  a  quadtree  has  at  least  two  points  in  its 
associated  square,  and  the  minimum  distance  between  two  points  is  c,  the  depth 
i  of  an  internal  node  must  satisfy 

sV 2/2'  ^  c, 


•V 

• 

i  <  log 


s\Jl 


which  implies 


c 


logO/c)  +  -*. 
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Chapter  14  The  lemma  now  follows  from  the  fact  that  the  depth  of  a  quadtree  is  exactly  one 
quadtrees  more  than  the  maximum  depth  of  any  internal  node.  Li 


Figure  14.4 
A  node  at  depth  i  corresponds  to  a 
square  of  side  length  i/2' 


The  size  of  a  quadtree  and  the  construction  time  are  a  function  of  the  depth 
of  the  quadtree  and  the  number  of  points  in  P. 

Theorem  14.2  A  quadtree  of  depth  d  storing  a  set  of  n  points  has  0((d  +  1  )n) 
nodes  and  can  be  constructed  in  0((d  +  1  )n)  time. 

Proof.  Every  internal  node  in  a  quadtree  has  four  children,  so  the  total  number 
of  leaves  is  one  plus  three  times  the  number  of  internal  nodes.  Hence,  it  suffices 
to  bound  the  number  of  internal  nodes. 

Any  internal  node  has  one  or  more  points  inside  its  associated  square. 
Moreover,  the  squares  of  the  nodes  at  the  same  depth  in  the  quadtree  are  disjoint 
and  exactly  cover  the  initial  square.  This  means  that  the  total  number  of  internal 
nodes  at  any  given  depth  is  at  most  n.  The  bound  on  the  size  of  the  quadtree 
follows. 

The  most  time-consuming  task  in  one  step  of  the  recursive  construction 
algorithm  is  the  distribution  of  points  over  the  quadrants  of  the  current  square. 
Hence,  the  amount  of  time  spent  at  an  internal  node  is  linear  in  the  number  of 
points  that  lie  in  the  associated  square.  We  argued  above  that  the  total  number 
of  points  associated  with  nodes  at  the  same  depth  in  the  tree  is  at  most  n,  from 
which  the  time  bound  follows.  0 


An  operation  on  quadtrees  that  is  often  needed  is  neighbor  finding-,  given  a 
node  v  and  a  direction — north,  east,  south,  or  west — find  a  node  v'  such  that 
<j(v')  is  adjacent  to  cr(v)  in  the  given  direction.  Usually  the  given  node  is  a 
leaf,  and  one  wants  the  reported  node  to  be  a  leaf  as  well.  This  corresponds  to 
finding  an  adjacent  square  of  a  given  square  in  the  quadtree  subdivision.  The 
algorithm  we  shall  describe  is  a  little  bit  different:  the  given  node  v  can  also  be 
internal,  and  the  algorithm  tries  to  find  the  node  V1  such  that  <7(V)  is  adjacent 
to  cr(v)  in  the  given  direction  and  v'  and  v  are  at  the  same  depth.  If  there  is  no 
such  node,  then  it  finds  the  deepest  node  whose  square  is  adjacent.  If  there  is 
no  adjacent  square  in  the  given  direction — this  can  happen  if  <t(v)  has  an  edge 
contained  in  an  edge  of  the  initial  square — then  the  algorithm  reports  nil. 

The  neighbor-finding  algorithm  works  as  follows.  Suppose  that  we  want 
to  find  the  north-neighbor  of  v.  If  v  happens  to  be  the  SE-  or  SW-child  of  its 
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parent,  then  its  north-neighbor  is  easy  to  find:  it  is  the  NE-  or  NW-child  of  the 
parent,  respectively.  If  v  itself  is  the  NE-  or  NW-child  of  its  parent,  then  we 
proceed  as  follows.  We  recursively  find  the  north-neighbor,  p,  of  the  parent  of 
V.  If  /i  is  an  internal  node,  then  the  north-neighbor  of  v  is  a  child  of  /i;  if  p 
is  a  leaf,  then  the  north-neighbor  we  seek  is  p  itself.  The  pseudocode  for  this 
algorithm  is  as  follows. 


Section  14.2 

QUADTREES  FOR  POINT  SETS 


north-neighbor  of  parent(v) 


Algorithm  NorthNeighbor(v,T) 

Input.  A  node  v  in  a  quadtree  X 

Output.  The  deepest  node  v'  whose  depth  is  at  most  the  depth  of  v  such  that 
<7(V)  is  a  north-neighbor  of  cr(v),  and  nil  if  there  is  no  such  node. 

1.  if  v  =  root (X)  then  return  nil 

2.  if  v  =  SW-child  of  parent(v)  then  return  NW-child  of  parent(v) 

3.  if  V  =  SE-child  of  parent(v)  then  return  NE-child  of  parent(v) 

4.  p  <— NORTHNEIGHBOR(parenf(v),T) 

5.  if  p  =  nil  or  p  is  a  leaf 

6.  then  return  p 

7.  else  if  v  =  NW-child  of  parent(v) 

8.  then  return  SW-child  of  p 

9.  else  return  SE-child  of  p 


a(v) 

This  algorithm  does  not  necessarily  report  a  leaf  node.  If  we  insist  on  finding  a 
leaf  node,  then  we  have  to  walk  down  the  quadtree  from  the  node  found  by  our 
algorithm,  always  proceeding  to  a  south-child. 

The  algorithm  spends  0(1)  time  at  every  recursive  call.  Moreover,  at  every 
call  the  depth  of  the  argument  node  v  decreases  by  one.  Hence,  the  running 
time  is  linear  in  the  depth  of  the  quadtree.  We  get  the  following  theorem: 


Theorem  14.3  Let  T  be  a  quadtree  of  depth  d.  The  neighbor  of  a  given  node  v 
in  X  in  a  given  direction,  as  defined  above,  can  be  found  in  0(d  +1)  time. 

We  already  observed  that  a  quadtree  can  be  quite  unbalanced.  As  a  result, 
large  squares  can  be  adjacent  to  many  small  squares.  In  some  applications — in 
particular  in  the  application  to  meshing — this  is  unwanted.  Therefore  we  now 
discuss  a  variant  of  quadtrees,  the  balanced  quadtree,  that  does  not  have  this 
problem. 

A  quadtree  subdivision  is  called  balanced  if  any  two  neighboring  squares 
differ  at  most  a  factor  two  in  size.  A  quadtree  is  called  balanced  if  its  subdivision 
is  balanced.  So  in  a  balanced  quadtree  any  two  leaves  whose  squares  are 
neighbors  can  differ  at  most  one  in  depth.  Figure  14.5  shows  an  example  of  a 
quadtree  subdivision  that  has  been  made  balanced.  The  original  subdivision  is 
shown  solid,  and  its  refinement  is  dotted. 

A  quadtree  can  be  made  balanced  with  the  following  algorithm: 


Algorithm  BalanceQuadTree(T) 

Input.  A  quadtree  X 

Output.  A  balanced  version  of  X 

1 .  Insert  all  the  leaves  of  T  into  a  linear  list  £. 

2.  while  £  is  not  empty 
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Figure  14.5 

A  quadtree  and  its  balanced  version 


3.  do  Remove  a  leaf  /i  from  L. 

4.  if  cr(ju)  has  to  be  split 

5.  then  Make  /i  into  an  internal  node  with  four  children,  which  are 

leaves  that  correspond  to  the  four  quadrants  of  cr(/i).  If  ii 
stores  a  point,  then  store  the  point  in  the  correct  new  leaf 
instead. 

6.  Insert  the  four  new  leaves  into  L. 

7.  Check  if  (j(ju)  had  neighbors  that  now  need  to  be  split  and. 
if  so,  insert  them  into  L. 

8.  return  T 

There  are  two  steps  in  the  algorithm  that  require  some  explanation. 

First,  we  have  to  check  whether  a  given  square  needs  to  be  split.  This 
means  that  we  have  to  check  whether  is  adjacent  to  a  square  of  less  than 
half  its  size.  This  can  be  done  using  the  neighbor-finding  algorithm  described 
earlier,  as  follows.  Suppose  that  we  are  looking  for  a  north-neighbor  of  cr(/j. ) 
that  is  less  than  half  the  size  of  There  is  such  a  square  if  and  only  if 

NorthNeighbor(/t,T)  reports  a  node  that  has  a  SW-child  or  a  SE-child  that 
is  not  a  leaf. 

Second,  we  have  to  check  if  cr(jU )  had  neighbors  that  now  need  to  be  split. 
Again,  this  can  be  done  using  the  neighbor-finding  algorithm:  for  example, 
cr(jU)  has  such  a  neighbor  to  the  north  if  and  only  if  North  NHiGimoRi/i.'T) 
reports  a  node  whose  square  is  larger  than  ct(/j,). 

So  now  we  have  an  algorithm  to  make  a  quadtree  balanced.  But  before  we 
can  analyze  the  running  time  of  the  balancing  algorithm  we  must  answer  the 
following  question:  what  happens  to  the  size  of  the  quadtree  when  we  make 
it  balanced?  From  Figure  14.5  we  may  get  the  idea  that  the  complexity  of  a 
balanced  quadtree  subdivision  can  be  quite  a  lot  higher  than  that  of  its  unbal¬ 
anced  version.  First  of  all,  large  squares  adjacent  to  very  small  ones  get  split  up 
many  times.  Secondly,  the  splitting  may  propagate:  sometimes  it  seems  that  a 
square  cr  need  not  be  split  because  its  neighbors  initially  have  the  right  size,  but 
these  neighbors  may  have  to  be  split  so  that  cr  must  be  split  after  all.  The  next 
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theorem  shows  that  things  are  not  as  bad  as  appears  at  first  sight,  and  that  the 
balancing  can  be  done  efficiently. 

Theorem  14.4  Let  7  be  a  quadtree  with  m  nodes.  Then  the  balanced  version  of 
7  has  0(m)  nodes  and  it  can  be  constructed  in  0((d  +  1  )m)  time. 

Proof.  We  first  prove  the  bound  on  the  number  of  nodes.  Denote  the  balanced 
version  of  7  by  Tg.  The  tree  Tg  is  obtained  from  T  by  a  number  of  splitting 
operations,  which  replace  a  leaf  by  one  internal  node  with  four  leaves.  We  shall 
prove  that  only  8 m  splitting  operations  are  performed.  Since  a  single  splitting 
operation  increases  the  total  number  of  (internal  and  leaf)  nodes  by  four,  this 
proves  that  the  number  of  nodes  in  T#  is  0(m),  as  claimed. 

Call  the  squares  of  nodes  in  7  old  squares,  and  the  squares  of  nodes  that 
are  in  7 g  but  not  in  7  new  squares.  Suppose  that  we  have  to  split  an — old  or 
new — square  a  in  the  balancing  process.  (The  quadrants  of  a  may  have  to  be 
split  further,  but  this  will  be  accounted  for  separately.  Here  we  only  need  to 
account  for  the  increase  in  the  number  of  nodes  by  four  due  to  the  splitting 
of  a.)  Below  we  shall  prove  that  at  least  one  of  the  eight  equal-sized  squares 
surrounding  a  must  be  an  old  square.  We  charge  the  splitting  of  a  to  one 
of  these  old  square.  Every  old  square — equivalently,  every  node  of  T — gets 
charged  at  most  eight  splittings  in  this  manner,  so  the  total  number  of  splits  is  at 
most  8 m,  as  claimed. 

We  now  prove  that  for  any  square  that  is  split  in  the  balancing  process,  at 
least  one  of  the  eight  equal-sized  squares  surrounding  it  must  be  an  old  square. 
Suppose  there  would  be  squares  for  which  this  claim  does  not  hold.  Let  a  be 
the  smallest  such  square.  Since  a  is  split,  it  must  have  been  adjacent  to  a  small 
square,  one  of  less  than  half  its  size.  Let  a'  be  the  square  that  has  exactly  half 
the  size  of  a  and  that  contains  this  small  square.  Since  a'  is  contained  in  a 
new  square,  it  is  also  new  itself.  Hence,  it  must  have  been  split  in  the  balancing 
process.  Now  we  observe  that  all  eight  equal-sized  squares  surrounding  o'  must 
be  new,  because  they  are  either  contained  in  one  the  squares  surrounding  a 
(which  are  new)  or  in  a  (which  is  assumed  to  be  split  in  the  balancing  process). 
Hence,  o' ,  which  is  smaller  than  <7,  is  a  square  that  is  split  and  none  of  whose 
eight  surrounding  squares  is  old.  This  contradicts  the  definition  of  a ,  thus 
finishing  the  proof  of  the  first  part  of  the  theorem. 

What  remains  is  to  prove  that  BalanceQuadTree  takes  0((d  +  1  )m)  time. 
The  time  needed  to  handle  a  node  /i  is  0(d  +  1),  because  a  constant  number  of 
neighbor-finding  operations  is  performed.  Since  any  node  is  handled  at  most 
once  and  the  total  number  of  nodes  is  0(m ),  the  total  time  is  0((d  +  1  )m).  ED 


14.3  From  Quadtrees  to  Meshes 
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FROM  QUADTREES  TO  MESHES 


(7  . . J. 

-  a' 


We  now  return  to  the  mesh  generation  problem.  Recall  that  the  input  is  a  square 
[0  :  U]  x  [0  :  U],  where  U  =  2-'  for  some  positive  integer  with  a  number  of 
disjoint  polygonal  components  inside  it.  The  polygon  vertices  have  integer 
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coordinates,  and  the  polygon  edges  have  one  of  four  possible  orientations:  the 
angle  that  any  edge  makes  with  the  positive  x-axis  is  0°,  45°,  90°,  or  135°.  The 
goal  is  to  compute  a  triangular  mesh  of  the  square  (both  outside  and  inside  the 
components)  that  is  conforming,  respects  the  input,  has  well-shaped  triangles, 
and  is  non-uniform. 

The  idea  is  to  use  a  quadtree  subdivision  as  the  first  step  towards  a  mesh. 
When  we  construct  a  quadtree  on  a  point  set,  then  we  stop  the  recursive  con¬ 
struction  when  a  square  contains  less  than  two  points.  Since  we  are  now  dealing 
with  polygonal  input,  we  have  to  reformulate  the  stopping  criterion.  Because 
we  want  the  mesh  triangles  to  be  fine  near  the  edges  of  the  components,  we 
keep  on  splitting  as  long  as  a  square  is  intersected  by  an  edge.  More  precisely, 
the  new  stopping  criterion  is  this:  we  stop  splitting  when  the  square  is  no  longer 
intersected  by  any  edge  of  any  component,  or  when  it  has  unit  size.  We  consider 
the  square  and  the  edges  to  be  closed,  so  that  for  instance  a  square  that  has  an 
edge  contained  in  one  of  its  sides  is  intersected  by  that  edge.  This  stopping 
criterion  guarantees  that  the  quadtree  subdivision  will  be  non-uniform:  the  edges 
of  the  components  will  be  surrounded  by  unit  size  squares,  and  the  squares  will 
be  bigger  farther  away  from  the  edges. 

We  claim  that  the  interior  of  any  square  in  the  resulting  quadtree  subdivision 
can  be  intersected  by  an  edge  of  a  component  in  only  one  way:  the  intersection 
must  be  a  diagonal  of  the  square.  Indeed,  squares  whose  closure  is  intersected 
have  unit  size,  and  the  vertices  of  the  components  have  integers  coordinates. 
Hence,  the  interior  of  a  square  cannot  be  intersected  by  a  horizontal  or  vertical 
edge,  and  the  intersection  with  an  edge  of  orientation  45°  or  135°  must  be  a 
diagonal.  It  seems  that  we  only  have  to  add  diagonals  to  the  squares  whose 
interior  is  not  intersected  to  obtain  a  good  mesh.  The  resulting  triangles  will 
respect  the  input,  they  will  be  well-shaped,  and  the  mesh  will  be  non-uniform. 
Unfortunately,  the  mesh  is  not  conforming.  We  can  remedy  this  by  taking  the 
subdivision  vertices  on  the  sides  of  the  square  into  account  when  we  triangulate 
it,  but  this  leads  to  another  problem:  if  a  square  has  many  vertices  on  a  side, 
then  not  all  triangles  we  get  are  well-shaped. 

To  avoid  these  problems  we  make  the  quadtree  subdivision  balanced  before 
we  triangulate  it.  Once  we  have  a  balanced  quadtree  subdivision,  we  can  easily 
generate  a  mesh  with  well-shaped  triangles  as  follows.  Squares  that  do  not  have 
a  vertex  in  the  interior  of  one  of  their  sides  (and  that  are  not  already  triangulated 
by  an  edge  of  one  of  the  components)  are  triangulated  by  adding  a  diagonal. 
Because  the  subdivision  is  balanced,  the  remaining  squares  have  at  most  one 
vertex  in  the  interior  of  each  side.  Moreover,  this  vertex  must  be  in  the  middle 
of  the  side.  Hence,  if  we  add  one  Steiner  point  in  the  center  of  such  a  square 
and  connect  it  to  all  vertices  on  the  boundary,  then  we  get  only  triangles  with 
two  45°  angles  and  one  90°  angle. 


Summarizing,  the  mesh  generation  algorithm  is  as  follows.  GenerateMesh 
constructs  the  mesh  in  the  form  of  a  doubly-connected  edge  list.  We  omit  the 
details  related  to  handling  the  doubly-connected  edge  list — in  particular,  how  to 
construct  the  doubly-connected  edge  list  corresponding  to  a  given  quadtree. 
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Algorithm  GenerateMesh(S)  Section  14.3 

Input.  A  set  S  of  components  inside  the  square  [0 :  U]  x  [0 :  U]  with  the  properties  from  quadtrees  to  meshes 
stated  at  the  beginning  of  this  section. 

Output.  A  triangular  mesh  M  that  is  conforming,  respects  the  input,  consists  of 
well-shaped  triangles,  and  is  non-uniform. 

1.  Construct  a  quadtree  T  on  the  set  S  inside  the  square  [0  :  U]  x  [0  :  U]  with 
the  following  stopping  criterion:  a  square  is  split  as  long  as  it  is  larger  than 
unit  size  and  its  closure  intersects  the  boundary  of  some  component. 

2.  T  <—  BalanceQuadTree(T) 

3.  Construct  the  doubly-connected  edge  list  for  the  quadtree  subdivision  M 
corresponding  to  T. 

4.  for  each  face  a  of  M 

5.  do  if  the  interior  of  cr  is  intersected  by  an  edge  of  a  component 

6.  then  Add  the  intersection  (which  is  a  diagonal)  as  an  edge  to  M. 

7.  else  if  cr  has  only  vertices  at  its  corners 

8.  then  Add  a  diagonal  of  cr  as  an  edge  to  M. 

9.  else  Add  a  Steiner  point  in  the  center  of  cr,  connect  it 

to  all  vertices  on  the  boundary  of  cr,  and  change  M 
accordingly. 

10.  return  M 


The  following  theorem  summarizes  the  properties  of  the  mesh  that  is  constructed 
by  our  algorithm. 

Theorem  14.5  LetS  be  a  set  of  disjoint  polygonal  components  inside  the  square 
[0  :  U]  x  [0  :  U]  with  the  properties  stated  in  the  beginning  of  this  section.  Then 
there  exists  an  non-uniform  triangular  mesh  for  this  input  that  is  conforming, 
respects  the  input,  and  has  only  well-shaped  triangles.  The  number  of  triangles 
is  0(p(S)  logf/),  where  p(S)  is  the  sum  of  the  perimeters  of  the  components  in 
S,  and  the  mesh  can  be  constructed  in  0(p(S)  log2  U)  time. 

Proof.  The  properties  of  the  mesh — that  it  is  non-uniform,  conforming,  that 
it  respects  the  input,  and  consists  of  well-shaped  triangles — follow  from  the 
discussion  above.  What  remains  is  to  bound  the  size  and  preprocessing  time  of 
the  mesh. 

The  mesh  construction  has  three  phases:  first  we  construct  a  quadtree 
subdivision,  then  we  make  it  balanced,  and  finally  we  triangulate  it. 

To  bound  the  size  of  the  quadtree  subdivision  resulting  from  the  first  phase 
we  use  the  following  observation.  Consider  the  grid  whose  cells  have  unit  size. 

Then  the  number  of  cells  whose  closure  is  intersected  by  a  line  segment  of  length 
/  is  at  most  4  +  3l/\/2.  It  follows  that  the  number  of  cells  whose  closure  is 
intersected  by  an  edge  of  any  of  the  components  is  0(p(S)).  Obviously  the  same 
is  true  for  any  grid  with  larger  cells.  Hence,  the  number  of  internal  nodes  of 
the  quadtree  at  a  fixed  depth  is  0(p(S)).  The  depth  of  the  quadtree  is  <9(logf/), 
since  we  stop  splitting  once  a  cell  has  unit  size.  The  total  number  of  nodes  of 
the  quadtree  and.  hence,  the  complexity  of  the  corresponding  subdivision,  is 
therefore  0(p(S)  logf/).  317 
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We  know  from  Theorem  14.4  that  making  a  quadtree  subdivision  balanced 
does  not  increase  its  complexity  asymptotically.  The  same  is  true  for  the  third 
phase  of  the  mesh  generation,  the  triangulation  of  the  squares  of  the  balanced 
subdivision.  (This  follows  because  each  cell  is  subdivided  into  a  constant 
number  of  triangles.)  We  conclude  that  the  number  of  triangles  of  the  final  mesh 
is  linear  in  the  complexity  of  the  quadtree  subdivision  resulting  from  the  first 
phase,  which  we  just  proved  to  be  0(p(S)\ogU). 

What  remains  is  to  prove  the  bound  on  the  construction  time.  The  first 
phase  is  dominated  by  the  time  needed  to  do  the  splitting  steps  in  the  recursive 
quadtree  construction  algorithm.  For  a  given  node  this  is  linear  in  the  number  of 
component  edges  intersecting  its  closure.  We  argued  above  that  the  sum  of  the 
number  of  intersecting  edges  over  all  nodes  at  a  fixed  depth  is  0(p(S)).  Hence, 
the  total  time  needed  in  the  first  phase  is  0(p(S)\ogU).  By  Theorem  14.4,  the 
time  needed  to  do  the  balancing  introduces  an  extra  0(\ogU)  factor.  Construct¬ 
ing  a  doubly-connected  edge  list  from  a  given  quadtree,  as  well  as  triangulating 
the  balanced  subdivision,  can  be  done  within  the  same  amount  of  time.  The 
bound  on  the  preprocessing  time  follows.  0 


14.4  Notes  and  Comments 

Quadtrees  were  one  of  the  first  data  structures  for  higher-dimensional  data.  They 
were  developed  by  Finkel  and  Bentley  in  1974  [177].  Since  then,  there  have 
been  hundreds  of  papers  dealing  with  quadtrees.  The  surveys  and  books  by 
Samet  [332,  333,  334,  335]  and  the  handbook  chapter  by  Aluru  [14]  give  an 
extensive  overview  of  the  various  types  of  quadtrees  and  their  applications. 

As  we  have  seen  in  this  chapter,  the  size  and  depth  of  a  quadtree  for  a  set 
of  n  points  cannot  be  bounded  in  terms  of  the  number  of  the  number  of  points 
only.  The  reason  is  that  there  can  be  many  nodes  with  only  one  non-empty 
subtree.  If  one  removes  such  nodes  from  the  tree,  then  its  size  becomes  linear. 
The  resulting  structure  is  called  a  compressed  quadtree — see  for  example  the 
survey  paper  by  Aluru  [14].  Combining  the  compressed  quadtree  with  ideas 
from  skip  lists,  Eppstein  et  al.  [173]  developed  a  quadtree  variant  that  has  not 
only  linear  size,  but  also  allows  insertions,  deletions,  and  search  operations  in 
O(\ogn)  time. 

Mesh  generation  is  but  one  application  of  quadtrees.  They  are  also  used  in  com¬ 
puter  graphics,  image  analysis,  geographic  information  systems,  and  many  other 
areas.  Quadtrees  are  often  used  to  answer  range  queries.  From  a  theoretical  point 
of  view  quadtrees  are  not  the  best  solution  to  range  searching  problems,  because 
usually  no  sublinear  bounds  on  the  query  time  can  be  proven.  Other  solutions 
to  various  range  searching  problems  are  described  in  Chapters  5,  10,  and  16. 
In  practice,  quadtrees  often  seem  to  perform  well.  Quadtrees  have  also  been 
applied  to  solve  tasks  like  hidden  surface  removal,  ray  tracing,  medial  axis 
transforms,  map  overlay  of  raster  maps,  and  nearest  neighbor  finding. 

Quadtrees  can  easily  be  generalized  to  higher  dimensions,  where  they  are 
usually  called  octrees. 
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The  mesh  generation  problem  is  important  in  many  areas  and  it  has  been  studied  Section  14.4 

extensively,  both  in  the  plane  and  in  3-dimensional  space.  The  surveys  by  Bern  notes  and  comments 

and  Eppstein  [62],  Bern  and  Plasman  [65],  Bern  [61],  and  Ho-Le  [215]  are  good 
starting  points  to  look  for  results  on  a  specific  setting  of  the  mesh  generation 
problem.  We  shall  briefly  describe  a  few  of  the  results. 

One  can  distinguish  structured  and  unstructured  meshes.  Structured  meshes 
are  usually  (deformed)  grids;  unstructured  meshes  are  often  triangulations  or 
quadrilateral  meshes.  We  shall  restrict  our  discussion  to  unstructured  triangu¬ 
lations.  Furthermore,  we  mainly  concentrate  on  the  case  where  the  domain 
to  be  meshed  is  2-dimensional  and  polygonal.  In  most  applications  there  are 
requirements  on  the  mesh  that  are  similar  to  the  ones  imposed  in  this  chapter. 

In  particular,  one  usually  requires  the  mesh  to  be  conforming — sometimes  the 
term  “consistent”  is  used — and  to  respect  the  input.  Furthermore,  some  sort 
of  well-shapedness  is  important.  This  usually  means  that  the  triangles  have  to 
satisfy  one  or  both  of  the  following  criteria:  (i)  There  should  be  no  small  angles, 
that  is,  every  angle  should  be  at  least  some  fixed  (not  too  small)  constant  9. 

This  constant  should  not  be  larger  than  the  smallest  angle  of  the  input  domain, 
because  we  cannot  avoid  the  input  angles  in  the  mesh,  (ii)  There  should  be 
no  obtuse  angles,  that  is,  no  angles  larger  than  90° .  In  the  example  studied  in 
this  chapter  we  wanted  the  triangles  to  satisfy  both  criteria,  and  we  then  took 
9  =  45°.  Often  the  goal  is  to  minimize  the  number  of  mesh  elements  under  the 
given  conditions.  This  implies  some  sort  of  non-uniformity:  small  triangles 
should  be  used  only  where  needed. 

We  first  consider  minimizing  the  number  of  triangles  in  the  mesh  under  the 
condition  that  there  be  no  small  angles.  The  number  of  triangles  that  we  obtain 
in  a  mesh  of  a  polygonal  domain  under  this  condition  not  only  depends  on  the 
number  of  vertices  of  the  domain,  it  also  depends  on  the  shape  of  the  domain. 

To  see  this,  we  introduce  a  parameter  that  is  closely  related  to  the  minimum 
angle  of  a  triangle,  namely  the  aspect  ratio  of  the  triangle.  This  is  the  ratio  of  the 
length  of  the  longest  side  of  the  triangle  to  the  height  of  the  triangle,  where  the 
height  of  a  triangle  is  the  Euclidean  distance  of  the  longest  edge  to  its  opposite 
vertex.  If  the  smallest  angle  of  a  triangle  is  9  then  the  aspect  ratio  is  between 
1  /  sin  9  and  2 /  sin  9.  Now  consider  a  rectangular  domain  whose  shorter  sides 
have  length  1  and  whose  longer  sides  have  length  A.  Suppose  we  require  that  the 
minimum  angle  be,  say,  30°.  This  implies  that  the  aspect  ratio  of  any  triangle  in 
the  mesh  must  be  less  than  or  equal  to  2/  sin  30°  =  4.  Furthermore,  the  height 
of  any  triangle  in  the  domain  is  at  most  one.  Hence,  the  area  of  any  triangle 
is  (9(1).  Because  the  total  area  of  the  rectangular  domain  is  A,  this  implies 
that  we  need  at  least  £2 (A)  triangles  in  the  mesh.  Bern  et  al.  [64]  describe  a 
method  based  on  quadtrees  that  produces  an  asymptotically  optimal  number 
of  triangles.  The  method  described  in  this  chapter  is  based  on  their  technique. 

Other  mesh-generation  algorithms  are  based  on  the  Delaunay  triangulation 
described  in  Chapter  9.  See  for  example  the  work  by  Shewchuck  [356,  357]  for 
a  discussion  of  Delaunay-based  algorithms  in  the  plane. 

If  the  only  requirement  is  that  the  triangles  in  the  mesh  are  non-obtuse  then  it 
turns  out  to  be  possible  to  construct  a  mesh  for  a  given  polygonal  domain  whose 
number  of  triangles  only  depends  on  the  number  of  vertices  of  the  domain. 
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Chapter  14  More  precisely,  Bern  and  Eppstein  [63]  have  shown  that  for  any  polygonal 
quadtrees  domain  with  n  vertices  there  is  a  mesh  consisting  of  0(n2)  non-obtuse  triangles. 
Bern  et  al.  [67]  improved  this  bound  to  0(n). 

Melissaratos  and  Souvaine  [278]  extended  the  approach  of  Bern  et  al.  [64] 
for  computing  a  mesh  without  small  angles  so  that  it  also  avoids  obtuse  triangles. 
The  number  of  triangles  in  the  mesh  is  still  at  most  a  constant  factor  from 
optimal. 

Minimizing  the  number  of  triangles  is  not  always  the  goal  of  meshing 
algorithms.  It  can  also  be  important  to  be  able  to  control  the  mesh  density, 
so  that  one  can  have  a  dense  mesh  in  interesting  areas  and  a  coarse  mesh  in 
uninteresting  areas.  This  is  the  setting  studied  by  Chew  [117].  He  describes 
a  meshing  algorithm  that  allows  the  user  to  define  a  function  that  determines 
whether  a  triangle  of  the  mesh  is  fine  enough.  The  angles  of  the  triangles 
produced  by  his  algorithm  are  between  30°  and  120°.  Another  nice  aspect  of 
his  work  is  that  the  algorithm  not  only  deals  with  planar  regions,  but  also  with 
regions  on  surface  patches. 

A  technique  called  edge  insertion  can  also  give  optimal  triangulations  for 
various  criteria  [66].  The  idea  is  to  improve  a  triangulation  in  steps  by  adding 
an  edge,  removing  the  intersected  edges,  and  retriangulating  the  polygonal 
regions  that  appear  optimally.  It  works  for  certain  ‘minmax’  type  of  criteria, 
like  minimizing  the  maximum  angle  in  the  triangulation,  and  minimizing  the 
maximum  slope  when  the  triangulation  represents  a  piecewise  linear  surface. 
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14.5  Exercises 

14.1  In  Figure  14.2  a  uniform  and  a  non-uniform  mesh  are  shown  for  a 
unit  square  in  the  top  left  corner  of  square  domain  of  size  16.  Consider 
similar  meshes  in  squares  of  larger  sizes  U  =  2-'  for  an  integer  j.  Express 
the  number  of  triangles  in  the  mesh  for  both  meshes  in  terms  of  j. 

14.2  Suppose  a  triangular  mesh  is  needed  inside  a  rectangle  whose  sides  have 
length  1  and  length  k  >  1 .  Steiner  points  may  not  be  used  on  the  sides, 
but  they  may  be  used  inside  the  rectangle.  Also  assume  that  all  triangles 
must  have  angles  between  30°  and  90°.  Is  it  always  possible  to  create  a 
triangular  mesh  with  these  properties?  Suppose  it  is  possible  to  create 
a  mesh  for  a  particular  input,  what  is  the  minimum  number  of  Steiner 
points  needed? 

14.3  All  triangles  produced  by  the  meshing  algorithm  of  this  chapter  are 
non-obtuse,  that  is,  they  do  not  have  angles  larger  than  90°.  Prove  that  if 
a  triangulation  of  a  set  P  of  points  in  the  plane  contains  only  non-obtuse 
triangles,  then  it  must  be  a  Delaunay  triangulation  of  P. 

14.4  Let  P  be  a  set  of  point  in  3-dimensional  space.  Describe  an  algorithm  to 
construct  an  octree  on  P.  (An  octree  is  the  3-dimensional  variant  of  the 
quadtree.) 


14.5  It  is  possible  to  reduce  the  size  of  a  quadtree  of  depth  d  for  a  set  of  Section  14.5 

points  (with  real  coordinates)  inside  a  square  from  0((d  +  1  )n)  to  0(n).  exercises 

The  idea  is  to  discard  any  node  v  that  has  only  one  child  under  which 

points  are  stored.  The  node  is  discarded  by  replacing  the  pointer  from 
the  parent  of  v  to  v  by  the  pointer  from  the  parent  to  the  only  interesting 
child  of  v.  Prove  that  the  resulting  tree  has  linear  size.  Can  you  also 
improve  upon  the  0((d  +  1  )n)  construction  time? 

14.6  In  this  chapter  we  called  a  quadtree  balanced  if  two  adjacent  squares  of 
the  quadtree  subdivision  differ  by  no  more  than  a  factor  two  in  size.  To 
save  a  constant  factor  in  the  number  of  extra  nodes  needed  to  balance  a 
quadtree,  we  could  weaken  the  balance  condition  by  allowing  adjacent 
squares  to  differ  by  a  factor  of  four  in  size.  Can  you  still  complete  such 
a  weakly  balanced  quadtree  subdivision  to  a  conforming  mesh  such  that 
all  angles  are  between  45°  and  90°  by  using  only  0(1)  triangles  per 
square? 

14.7  Suppose  we  make  the  balancing  condition  for  quadtrees  more  severe: 
we  no  longer  allow  adjacent  squares  to  differ  by  a  factor  two  in  size,  but 
we  require  them  to  have  exactly  the  same  size.  Is  the  number  of  nodes 
in  the  new  balanced  version  still  linear  in  the  number  of  nodes  of  the 
original  quadtree?  If  not,  can  you  say  anything  about  this  number? 

14.8  The  algorithm  to  construct  a  balanced  quadtree  had  two  phases:  a  normal 
quadtree  was  constructed,  which  was  then  balanced  in  a  postprocessing 
step.  It  is  also  possible  to  construct  a  balanced  quadtree  without  first 
constructing  the  unbalanced  version.  To  this  end  we  maintain  the  current 
quadtree  subdivision  in  a  doubly-connected  edge  list  during  the  quadtree 
construction,  and  whenever  we  split  a  square  we  check  whether  any 
neighbors  have  to  be  split.  Describe  the  algorithm  in  detail  and  analyze 
its  running  time. 

14.9  One  of  the  steps  in  Algorithm  GenerateMesh  is  to  construct  a  doubly- 
connected  edge  list  for  the  quadtree  subdivision  of  a  given  quadtree. 

Describe  an  algorithm  for  this  step,  and  analyze  its  running  time. 

14.10  A  quadtree  can  also  be  used  to  store  a  subdivision  for  efficient  point  lo¬ 
cation.  The  idea  is  to  keep  splitting  a  bounding  square  of  the  subdivision 
until  all  leaf  nodes  correspond  to  squares  that  contain  at  most  one  vertex 
and  only  edges  incident  to  that  vertex,  or  no  vertex  and  at  most  one  edge. 

a.  Since  a  vertex  can  be  incident  to  many  edges,  we  need  an  additional 
data  structure  at  the  quadtree  leaves  storing  vertices.  Which  data 
structure  would  you  use? 

b.  Describe  the  algorithm  for  constructing  the  point  location  data  struc¬ 
ture  in  detail,  and  analyze  its  running  time. 

c.  Describe  the  query  algorithm  in  detail,  and  analyze  its  running  time. 

14.1 1  Quadtrees  are  often  used  to  store  pixel  images.  In  this  case  the  initial 
square  is  exactly  the  size  of  the  image  (which  is  assumed  to  be  a  2k  x  2k 
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grid  for  some  integer  k).  A  square  is  split  into  subsquares  if  not  all  pixels 
inside  have  the  same  intensity. 

Prove  a  bound  on  the  complexity  of  the  quadtree  subdivision.  Hint:  This 
is  similar  to  the  bound  we  proved  on  the  size  of  the  quadtree  mesh. 

14.12  Suppose  we  have  quadtrees  on  pixel  images  I\  and  h  (see  the  previous 
exercise).  Both  images  have  size  2k  x  2k,  and  contain  only  two  intensi¬ 
ties,  0  and  1 .  Give  algorithms  for  Boolean  operations  on  these  images, 
that  is,  give  algorithms  to  compute  a  quadtree  for  7i  V  1 2  and  h  Ah. 
(Here  /,  V I2  is  the  2k  x  2k  image  where  pixel  (i,j)  has  intensity  1  if  and 
only  if  (i.j)  has  intensity  1  in  image  I\  or  in  image  h-  The  image  I\  Ah 
is  defined  similarly.) 

14.13  Quadtrees  can  be  used  to  perform  range  queries.  Describe  an  algorithm 
for  querying  a  quadtree  on  a  set  P  of  points  with  a  query  region  R. 
Analyze  the  worst-case  query  time  for  the  case  where  R  is  a  rectangle, 
and  for  the  case  where  R  is  a  half-plane  bounded  by  a  vertical  line. 

14.14  In  this  chapter  we  studied  quadtrees  that  store  a  set  of  point  in  the 
plane.  In  Chapter  5  we  studied  two  other  data  structures  for  storing 
sets  of  points  in  the  plane,  the  kd-tree  and  the  range  tree.  Discuss  the 
advantages  and  disadvantages  of  each  of  the  three  structures. 
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Finding  the  Shortest  Route 


In  Chapter  13  we  saw  how  to  plan  a  path  for  a  robot  from  a  given  start  position 
to  a  given  goal  position.  The  algorithm  we  gave  always  finds  a  path  if  it  exists, 
but  we  made  no  claims  about  the  quality  of  the  path:  it  could  make  a  large 
detour,  or  make  lots  of  unnecessary  turns.  In  practical  situations  we  would 
prefer  to  find  not  just  any  path,  but  a  good  path. 


Figure  15.1 
A  shortest  path 


What  constitutes  a  good  path  depends  on  the  robot.  In  general,  the  longer  a 
path,  the  more  time  it  will  take  the  robot  to  reach  its  goal  position.  For  a  mobile 
robot  on  a  factory  floor  this  means  it  can  transport  less  goods  per  time  unit, 
resulting  in  a  loss  of  productivity.  Therefore  we  would  prefer  a  short  path.  Often 
there  are  other  issues  that  play  a  role  as  well.  For  example,  some  robots  can  only 
move  in  a  straight  line;  they  have  to  slow  down,  stop,  and  rotate,  before  they 
can  start  moving  into  a  different  direction,  so  any  turn  along  the  path  causes 
some  delay.  For  this  type  of  robot  not  only  the  path  length  but  also  the  number 
of  turns  on  the  path  has  to  be  taken  into  account.  In  this  chapter  we  ignore 
this  aspect;  we  only  show  how  to  compute  the  Euclidean  shortest  path  for  a 
translating  planar  robot. 
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Chapter  15  15.1  Shortest  Paths  for  a  Point  Robot 

VISIBILITY  GRAPHS 

As  in  Chapter  13  we  first  consider  the  case  of  a  point  robot  moving  among  a 
set  S  of  disjoint  simple  polygons  in  the  plane.  The  polygons  in  S  are  called 
obstacles,  and  their  total  number  of  edges  is  denoted  by  n.  Obstacles  are  open 
sets,  so  the  robot  is  allowed  to  touch  them.  We  are  given  a  start  position  pstart 
and  a  goal  position  pgoai.  which  we  assume  are  in  the  free  space.  Our  goal  is  to 
compute  a  shortest  collision-free  path  from  pstart  to  pgoa],  that  is,  a  shortest  path 
that  does  not  intersect  the  interior  of  any  of  the  obstacles.  Notice  that  we  cannot 
say  the  shortest  path,  because  it  need  not  be  unique.  For  a  shortest  path  to  exist, 
it  is  important  that  obstacles  are  open  sets;  if  they  were  closed,  then  (unless  the 
robot  can  move  to  its  goal  in  a  straight  line)  a  shortest  path  would  not  exist,  as  it 
would  always  be  possible  to  shorten  a  path  by  moving  closer  to  an  obstacle. 


Let’s  quickly  review  the  method  from  Chapter  13.  We  computed  a  trapezoidal 
map  '.T(Ci  iee)  of  the  free  configuration  space  Cfree-  For  a  point  robot,  Cfree  was 
simply  the  empty  space  between  the  obstacles,  so  this  was  rather  easy.  The  key 
idea  was  then  to  replace  the  continuous  work  space,  where  there  are  infinitely 
many  paths,  by  a  discrete  road  map  Sroadi  where  there  are  only  finitely  many 
paths.  The  road  map  we  used  was  a  plane  graph  with  nodes  in  the  centers  of  the 
trapezoids  of  T(Cfree)  and  in  the  middle  of  the  vertical  extensions  that  separate 
adjacent  trapezoids.  The  nodes  in  the  center  of  each  trapezoid  were  connected 
to  the  nodes  on  its  boundary.  After  finding  the  trapezoids  containing  the  start 
and  goal  position  of  the  robot,  we  found  a  path  in  the  road  map  between  the 
nodes  in  the  centers  of  these  trapezoids  by  breadth-first  search. 


Figure  15.2 

The  shortest  path  does  not  follow  the 
road  map 


Because  we  used  breadth-first  search,  the  path  that  is  found  uses  a  minimum 
number  of  arcs  in  Sroad-  This  is  not  necessarily  a  short  path,  because  some  arcs 
are  between  nodes  that  are  far  apart,  whereas  others  are  between  nodes  that 
are  close  to  each  other.  An  obvious  improvement  would  be  to  give  each  arc 
a  weight  corresponding  to  the  Euclidean  length  of  the  segment  connecting  its 
incident  nodes,  and  to  use  a  graph  search  algorithm  that  finds  a  shortest  path  in 

_  a  weighted  graph,  such  as  Dijkstra’s  algorithm.  Although  this  may  improve  the 

324  path  length,  we  still  do  not  get  the  shortest  path.  This  is  illustrated  in  Figure  15.2: 


the  shortest  path  from  pstart  to  pgoai  following  the  road  map  passes  below  the  Section  15.1 

triangle,  but  the  real  shortest  path  passes  above  it.  What  we  need  is  a  different  shortest  paths  for  a  point  robot 

road  map,  one  which  guarantees  that  the  shortest  path  following  the  road  map  is 
the  real  shortest  path. 


Let’s  see  what  we  can  say  about  the  shape  of  a  shortest  path.  Consider  some 
path  from  pstart  to  pgoai-  Think  of  this  path  as  an  elastic  rubber  band,  whose 
endpoints  we  fix  at  the  start  and  goal  position  and  which  we  force  to  take  the 
shape  of  the  path.  At  the  moment  we  release  the  rubber  band,  it  will  try  to 
contract  and  become  as  short  as  possible,  but  it  will  be  stopped  by  the  obstacles. 
The  new  path  will  follow  parts  of  the  obstacle  boundaries  and  straight  line 
segments  through  open  space.  The  next  lemma  formulates  this  observation 
more  precisely.  It  uses  the  notion  of  an  inner  vertex  of  a  polygonal  path,  which 
is  a  vertex  that  is  not  the  begin-  or  endpoint  of  the  path. 

Lemma  15.1  Any  shortest  path  between  pshm  and  pgoai  among  a  set  S  of  disjoint 
polygonal  obstacles  is  a  polygonal  path  whose  inner  vertices  are  vertices  ofS. 

Proof.  Suppose  for  a  contradiction  that  a  shortest  path  T  is  not  polygonal.  Since 
the  obstacles  are  polygonal,  this  means  there  is  a  point  p  on  T  that  lies  in  the 
interior  of  the  free  space  with  the  property  that  no  line  segment  containing  p 
is  contained  in  T.  Since  p  is  in  the  interior  of  the  free  space,  there  is  a  disc  of 
positive  radius  centered  at  p  that  is  completely  contained  in  the  free  space.  But 
then  the  part  of  T  inside  the  disc,  which  is  not  a  straight  line  segment,  can  be 
shortened  by  replacing  it  with  the  segment  connecting  the  point  where  it  enters 
the  disc  to  the  point  where  it  leaves  the  disc.  This  contradicts  the  optimality  of  T, 
since  any  shortest  path  must  be  locally  shortest,  that  is,  any  subpath  connecting 
points  q  and  r  on  the  path  must  be  the  shortest  path  from  q  to  r. 

Now  consider  a  vertex  v  of  T.  It  cannot  lie  in  the  interior  of  the  free  space: 
then  there  would  be  a  disc  centered  at  p  that  is  completely  in  the  free  space, 
and  we  could  replace  the  subpath  of  T  inside  the  disc —  which  turns  at  v — by 
a  straight  line  segment  which  is  shorter.  Similarly,  v  cannot  lie  in  the  relative 
interior  of  an  obstacle  edge:  then  there  would  be  a  disc  centered  at  v  such  that 
half  of  the  disc  is  contained  in  the  free  space,  which  again  implies  that  we 
can  replace  the  subpath  inside  the  disc  with  a  straight  line  segment.  The  only 
possibility  left  is  that  v  is  an  obstacle  vertex.  0 

With  this  characterization  of  the  shortest  path,  we  can  construct  a  road  map 
that  allows  us  to  find  the  shortest  path.  This  road  map  is  the  visibility  graph 
of  S,  which  we  denote  by  Svjs(S).  Its  nodes  are  the  vertices  of  S,  and  there  is  an 
arc  between  vertices  v  and  vv  if  they  can  see  each  other,  that  is,  if  the  segment 
vw  does  not  intersect  the  interior  of  any  obstacle  in  S.  Two  vertices  that  can  see 
each  other  are  called  (mutually)  visible,  and  the  segment  connecting  them  is 
called  a  visibility  edge.  Note  that  endpoints  of  the  same  obstacle  edge  always 
see  each  other.  Hence,  the  obstacle  edges  form  a  subset  of  the  arcs  of  Svis(S)- 

By  Lemma  15.1  the  segments  on  a  shortest  path  are  visibility  edges,  except 
for  the  first  and  last  segment.  To  make  them  visibility  edges  as  well,  we  add 
the  start  and  goal  position  as  vertices  to  S,  that  is,  we  consider  the  visibility 


Pstart 
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graph  of  the  set  S*  :=  SU  {pstart,Pgoai}-  By  definition,  the  arcs  of  Svis(S*)  are 
between  vertices — which  now  include  psta rt  and  pgoai — that  can  see  each  other. 
We  get  the  following  corollary. 


Corollary  15.2  The  shortest  path  between  /Atari  and  pgoai  among  a  set  S  of 
disjoint  polygonal  obstacles  consists  of  arcs  of  the  visibility  graph  Svis(S*), 
where  S*  :=  SU  {pstait,Pgoai}- 

We  get  the  following  algorithm  to  compute  a  shortest  path  from  /?start  to  pgoai- 


Algorithm  SH0RTESTPATH(S, /Atari, pgoal) 

Input.  A  set  S  of  disjoint  polygonal  obstacles,  and  two  points  /?star t  and  pgoai  in 
the  free  space. 

Output.  The  shortest  collision-free  path  connecting  ps tart  and  pgoai- 

1.  Svis  <-  VISIBILITYGRAPH(5U  {pstartjPgoal}) 

2.  Assign  each  arc  (v,  w)  in  Svis  a  weight,  which  is  the  Euclidean  length  of 
the  segment  vw. 

3.  Use  Dijkstra’s  algorithm  to  compute  a  shortest  path  between  pstart  and 
/•'goal  hi  Svis- 

In  the  next  section  we  show  how  to  compute  the  visibility  graph  in  0(n2  logn) 
time,  where  n  is  the  total  number  of  obstacle  edges.  The  number  of  arcs  of 
Svis  is  of  course  bounded  by  (”+2).  Hence,  line  2  of  the  algorithm  takes  0{n2) 
time.  Dijkstra’s  algorithm  computes  the  shortest  path  between  two  nodes  in 
graph  with  k  arcs,  each  having  a  non-negative  weight,  in  0{n\ogn  +  k)  time. 
Since  k  =  O(tr),  we  conclude  that  the  total  running  time  of  ShortestPath  is 
0(n2  log n),  leading  to  the  following  theorem. 


Theorem  15.3  A  shortest  path  between  two  points  among  a  set  of  polygonal 
obstacles  with  n  edges  in  total  can  be  computed  in  0(n2  logn)  time. 


15.2  Computing  the  Visibility  Graph 

Let  S  be  a  set  of  disjoint  polygonal  obstacles  in  the  plane  with  n  edges  in 
total.  (Algorithm  ShortestPath  of  the  previous  section  needs  to  compute  the 
visibility  graph  of  the  set  S*,  which  includes  the  start  and  goal  position.  The 
presence  of  these  ‘isolated  vertices’  does  not  cause  any  problems  and  therefore 
we  do  not  explicitly  deal  with  them  in  this  section.  )  To  compute  the  visibility 
graph  of  S,  we  have  to  find  the  pairs  of  vertices  that  can  see  each  other.  This 
means  that  for  every  pair  we  have  to  test  whether  the  line  segment  connecting 
them  intersects  any  obstacle.  Such  a  test  would  cost  O(n)  time  when  done 
naively,  leading  to  an  0(«3)  running  time.  We  will  see  shortly  that  the  test  can 
be  done  more  efficiently  if  we  don’t  consider  the  pairs  in  arbitrary  order,  but 
concentrate  on  one  vertex  at  a  time  and  identify  all  vertices  visible  from  it,  as  in 
the  following  algorithm. 
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Algorithm  V  isibilityGraph(S) 

Input.  A  set  S  of  disjoint  polygonal  obstacles. 

Output.  The  visibility  graph  Svis(S). 

1.  Initialize  a  graph  S  =  ( V,  E )  where  V  is  the  set  of  all  vertices  of  the 
polygons  in  S  and  E  =  0. 

2.  for  all  vertices  v  GV 

3.  do  W  <—  VisibleVertices(v,S) 

4.  For  every  vertex  vv  GW,  add  the  arc  (v,  vv)  to  E. 

5.  return  9 

The  procedure  VISIBLE  VERTICES  has  as  input  a  set  S  of  polygonal  obstacles 
and  a  point  p  in  the  plane;  in  our  case  p  is  a  vertex  of  S ,  but  that  is  not  required. 
It  should  return  all  obstacle  vertices  visible  from  p. 

If  we  just  want  to  test  whether  one  specific  vertex  vv  is  visible  from  p,  there  is 
not  much  we  can  do:  we  have  to  check  the  segment  pw  against  all  obstacles. 
But  there  is  hope  if  we  want  to  test  all  vertices  of  S:  we  might  be  able  to  use 
the  information  we  get  when  we  test  one  vertex  to  speed  up  the  test  for  other 
vertices.  Now  consider  the  set  of  all  segments  pw.  What  would  be  a  good  order 
to  treat  them,  so  that  we  can  use  the  information  from  one  vertex  when  we  treat 
the  next  one?  The  logical  choice  is  the  cyclic  order  around  p.  So  what  we  will 
do  is  treat  the  vertices  in  cyclic  order,  meanwhile  maintaining  information  that 
will  help  us  to  decide  on  the  visibility  of  the  next  vertex  to  be  treated. 

A  vertex  w  is  visible  from  p  if  the  segment  pw  does  not  intersect  the  interior 
of  any  obstacle.  Consider  the  half-line  p  starting  at  p  and  passing  through  w. 
If  w  is  not  visible,  then  p  must  hit  an  obstacle  edge  before  it  reaches  w.  To 
check  this  we  perform  a  binary  search  with  the  vertex  w  on  the  obstacle  edges 
intersected  by  p.  This  way  we  can  find  out  whether  w  lies  behind  any  of  these 
edges,  as  seen  from  p.  (If  p  itself  is  also  an  obstacle  vertex,  then  there  is  another 
case  where  vv  is  not  visible,  namely  when  p  and  vv  are  vertices  of  the  same 
obstacle  and  pw  is  contained  in  the  interior  of  that  obstacle.  This  case  can  be 
checked  by  looking  at  the  edges  incident  to  vv,  to  see  whether  p  is  in  the  interior 
of  the  obstacle  before  it  reaches  vv.  For  the  moment  we  ignore  degenerate  cases, 
where  one  of  the  incident  edges  of  vv  is  contained  in  pw.) 


While  treating  the  vertices  in  cyclic  order  around  p  we  therefore  maintain 
the  obstacle  edges  intersected  by  p  in  a  balanced  search  tree  T.  (As  we  will  see 
later,  edges  that  are  contained  in  p  need  not  be  stored  in  T.)  The  leaves  of  T 
store  the  intersected  edges  in  order:  the  leftmost  leaf  stores  the  first  segment 


Section  15.2 

COMPUTING  THE  VISIBILITY  GRAPH 


Figure  15.3 

The  search  tree  on  the  intersected  edges 
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Chapter  15  intersected  by  p,  the  next  leaf  stores  the  segment  that  is  intersected  next,  and 

visibility  graphs  so  on.  The  interior  nodes,  which  guide  the  search  in  T,  also  store  edges.  More 

precisely,  an  interior  node  v  stores  the  rightmost  edge  in  its  left  subtree,  so  that 
all  edges  in  its  right  subtree  are  greater  (with  respect  to  the  order  along  p)  than 
this  segment  ev ,  and  all  segments  in  its  left  subtree  are  smaller  than  or  equal 
to  ev  (with  respect  to  the  order  along  p).  Figure  15.3  shows  an  example. 

Treating  the  vertices  in  cyclic  order  effectively  means  that  we  rotate  the 
half-line  p  around  p.  So  our  approach  is  similar  to  the  plane  sweep  paradigm  we 
used  at  various  other  places;  the  difference  is  that  instead  of  using  a  horizontal 
line  moving  downward  to  sweep  the  plane,  we  use  a  rotating  half-line. 

The  status  of  our  rotational  plane  sweep  is  the  ordered  sequence  of  obstacle 
edges  intersected  by  p.  It  is  maintained  in  T.  The  events  in  the  sweep  are  the 
vertices  of  S.  To  deal  with  a  vertex  w  we  have  to  decide  whether  vv  is  visible 
from  p  by  searching  in  the  status  structure  T,  and  we  have  to  update  7  by 
inserting  and/or  deleting  the  obstacle  edges  incident  to  w. 
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Algorithm  VisibleVertices  summarizes  our  rotational  plane  sweep.  The 
sweep  is  started  with  the  half-line  p  pointing  into  the  positive  x-di  recti  on  and 
proceeds  in  clockwise  direction.  So  the  algorithm  first  sorts  the  vertices  by  the 
clockwise  angle  that  the  segment  from  p  to  each  vertex  makes  with  the  positive 
x-axis.  What  do  we  do  if  this  angle  is  equal  for  two  or  more  vertices?  To  be  able 
to  decide  on  the  visibility  of  a  vertex  w,  we  need  to  know  whether  ~pw  intersects 
the  interior  of  any  obstacle.  Hence,  the  obvious  choice  is  to  treat  any  vertices 
that  may  lie  in  the  interior  of  pw  before  we  treat  w.  In  other  words,  vertices  for 
which  the  angle  is  the  same  are  treated  in  order  of  increasing  distance  to  p.  The 
algorithm  now  becomes  as  follows: 

Algorithm  Visible Vertices(p,  5) 

Input.  A  set  S  of  polygonal  obstacles  and  a  point  p  that  does  not  lie  in  the 
interior  of  any  obstacle. 

Output.  The  set  of  all  obstacle  vertices  visible  from  p. 

1 .  Sort  the  obstacle  vertices  according  to  the  clockwise  angle  that  the  half¬ 
line  from  p  to  each  vertex  makes  with  the  positive  x-axis.  In  case  of 
ties,  vertices  closer  to  p  should  come  before  vertices  farther  from  p.  Let 
w\ , . . . ,  w„  be  the  sorted  list. 

2.  Let  p  be  the  half-line  parallel  to  the  positive  x-axis  starting  at  p.  Find 
the  obstacle  edges  that  are  properly  intersected  by  p,  and  store  them  in  a 
balanced  search  tree  7  in  the  order  in  which  they  are  intersected  by  p. 

3.  W<-0 

4.  for  i  <—  1  to  n 

5.  do  if  Visible(w,)  then  Add  w,  to  W. 

6.  Insert  into  7  the  obstacle  edges  incident  to  w;  that  lie  on  the  clock¬ 
wise  side  of  the  half-line  from  p  to  vv,-. 

7.  Delete  from  T  the  obstacle  edges  incident  to  vv,  that  lie  on  the 
counterclockwise  side  of  the  half-line  from  p  to  vv,-. 

8.  return  W 


The  subroutine  VISIBLE  must  decide  whether  a  vertex  vv;  is  visible.  Normally, 
this  only  involves  searching  in  T  to  see  if  the  edge  closest  to  p ,  which  is  stored 
in  the  leftmost  leaf,  intersects  pw ].  But  we  have  to  be  careful  when  pwj  contains 
other  vertices.  Is  w,-  visible  or  not  in  such  a  case?  That  depends.  See  Figure  15.4 
for  some  of  the  cases  that  can  occur,  pw]  may  or  may  not  intersect  the  interior 
of  the  obstacles  incident  to  these  vertices.  It  seems  that  we  have  to  inspect 
all  edges  with  a  vertex  on  pWj  to  decide  if  vv,-  is  visible.  Fortunately  we  have 
already  inspected  them  while  treating  the  preceding  vertices  that  lie  on  pw,.  We 
can  therefore  decide  on  the  visibility  of  vv,-  as  follows.  If  w,_i  is  not  visible  then 
w,-  is  not  visible  either.  If  w,_i  is  visible  then  there  are  two  ways  in  which  w,- 
can  be  invisible.  Either  the  whole  segment  w]Z\W]  lies  in  an  obstacle  of  which 
both  Wj- 1  and  vv,-  are  vertices,  or  the  segment  vv,  |  vr,  is  intersected  by  an  edge 
in  T.  (Because  in  the  latter  case  this  edge  lies  between  w,_  i  and  vv,-,  it  must 
properly  intersect  w;_iw,-.)  This  test  is  correct  because  pwj  =  pw,-- 1  Uwj-iWj. 
(If  i=  1,  then  there  is  no  vertex  in  between  p  and  vv,-,  so  we  only  have  to  look  at 
the  segment  pw].)  We  get  the  following  subroutine: 


Visible^,) 

1 .  if  pwj  intersects  the  interior  of  the  obstacle  of  which  vv,-  is  a  vertex,  locally 

at  vv,- 

2.  then  return  false 

3.  else  if  i  =  1  or  w,_i  is  not  on  the  segment  pw] 

4.  then  Search  in  T  for  the  edge  e  in  the  leftmost  leaf. 

5.  if  e  exists  and  pw,-  intersects  e 

6.  then  return  false 

7.  else  return  true 

8.  else  if  w,-_ 1  is  not  visible 

9.  then  return  false 

10.  else  Search  in  T  for  an  edge  e  that  intersects  vv,  ivv,. 

11.  if  e  exists 

12.  then  return  false 

13.  else  return  true 


Section  15.2 

COMPUTING  the  visibility  graph 


Figure  15.4 

Some  examples  where  p  contains 
multiple  vertices.  In  all  these  cases 
w,_  i  is  visible.  In  the  left  two  cases  vv; 
is  also  visible  and  in  the  right  two  cases 
vv;  is  not  visible. 
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Chapter  15  This  finishes  the  description  of  the  algorithm  VISIBLE  VERTICES  to  compute 
visibility  graphs  the  vertices  visible  from  a  given  point  p. 

What  is  the  running  time  of  VisibleVertices?  The  time  we  spent  before 
line  4  is  dominated  by  the  time  to  sort  the  vertices  in  cyclic  order  around  p, 
which  is  O(nlogn).  Each  execution  of  the  loop  involves  a  constant  number 
of  operations  on  the  balanced  search  tree  T,  which  take  O(\ogn)  time,  plus  a 
constant  number  of  geometric  tests  that  take  constant  time.  Hence,  one  execution 
takes  O(k)gn)  time,  leading  to  an  overall  running  time  of  O(nlogn). 

Recall  that  we  have  to  apply  VisibleVertices  to  each  of  the  n  vertices  of  S 
in  order  to  compute  the  entire  visibility  graph.  We  get  the  following  theorem: 

Theorem  15.4  The  visibility  graph  of  a  set  S  of  disjoint  polygonal  obstacles 
with  n  edges  in  total  can  be  computed  in  0(n2  logn)  time. 


15.3  Shortest  Paths  for  a  Translating  Polygonal  Robot 

In  Chapter  13  we  have  seen  that  we  can  reduce  the  motion  planning  problem 
for  a  translating,  convex,  polygonal  robot  3?  to  the  case  of  a  point  robot  by 
computing  the  free  configuration  space  Cfree-  The  reduction  involves  computing 
the  Minkowski  sum  of  —3k,  a  reflected  copy  of  3k,  with  each  of  the  obstacles,  and 
taking  the  union  of  the  resulting  configuration-space  obstacles.  This  gives  us  a 

work  space  configuration  space  visibility  graph 


Figure  15.5 
Computing  a  shortest  path  for  a 
polygonal  robot 

set  of  disjoint  polygons,  whose  union  is  the  forbidden  configuration  space.  We 
can  then  compute  a  shortest  path  with  the  method  we  used  for  a  point  robot:  we 
extend  the  set  of  polygons  with  the  points  in  configuration  space  that  correspond 
to  the  start  and  goal  placement,  compute  the  visibility  graph  of  the  polygons, 
assign  each  arc  a  weight  which  is  the  Euclidean  length  of  the  corresponding 
visibility  edge,  and  find  a  shortest  path  in  the  visibility  graph  using  Dijkstra’s 
algorithm. 

To  what  running  time  does  this  approach  lead?  Lemma  13.13  states  that 
the  forbidden  space  can  be  computed  in  0(nlog2n)  time.  Furthermore,  the 
complexity  of  the  forbidden  space  is  0(n)  by  Theorem  13.12,  so  from  the 
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previous  section  we  know  that  the  visibility  graph  of  the  forbidden  space  can  be 
computed  in  0(n2  logn)  time. 

This  leads  to  the  following  result: 

Theorem  15.5  Let  31  be  a  convex,  constant-complexity  robot  that  can  translate 
among  a  set  of  polygonal  obstacles  with  n  edges  in  total.  A  shortest  collision- 
free  path  for  31  from  a  given  start  placement  to  a  given  goal  placement  can  be 
computed  in  0(n2  log  n)  time. 


15.4  Notes  and  Comments 

The  problem  of  computing  the  shortest  path  in  a  weighted  graph  has  been  studied 
extensively.  Dijkstra’s  algorithm  and  other  solutions  are  described  in  most  books 
on  graph  algorithms  and  in  many  books  on  algorithms  and  data  structures.  In 
Section  15.1  we  stated  that  Dijkstra’s  algorithm  runs  in  O(n\ogn  +  k)  time.  To 
achieve  this  time  bound,  one  has  to  use  Fibonacci  heaps  in  the  implementation. 
In  our  application  an  0((n  +  k)logn)  algorithm  would  also  do  fine,  since  the 
rest  of  the  algorithm  needs  that  much  time  anyway. 

The  geometric  version  of  the  shortest  path  problem  has  also  received  consid¬ 
erable  attention.  The  algorithm  given  here  is  due  to  Lee  [247].  More  efficient 
algorithms  based  on  arrangements  have  been  proposed;  they  run  in  0{n2) 
time  [23,  158,  383], 

Any  algorithm  that  computes  a  shortest  path  by  first  constructing  the  entire 
visibility  graph  is  doomed  to  have  at  least  quadratic  running  time  in  the  worst 
case,  because  the  visibility  graph  can  have  a  quadratic  number  of  edges.  For  a 
long  time  no  approaches  were  known  with  a  subquadratic  worst-case  running 
time.  Mitchell  [281]  was  the  first  to  break  the  quadratic  barrier:  he  showed  that 
the  shortest  path  for  a  point  robot  can  be  computed  in  (9(n5/3+e)  time.  Later  he 
improved  the  running  time  of  his  algorithm  to  0{ni/2+e)  [282].  In  the  mean 
time,  however,  Hershberger  and  Suri  [210,  212]  succeeded  in  developing  an 
optimal  0(n\ogn)  time  algorithm. 

In  the  special  case  where  the  free  space  of  the  robot  is  a  polygon  without 
holes,  a  shortest  path  can  be  computed  in  linear  time  by  combining  the  linear¬ 
time  triangulation  algorithm  of  Chazelle  [94]  with  the  shortest  path  method  of 
Guibas  et  al.  [195]. 

The  3-dimensional  version  of  the  Euclidean  shortest  path  problem  is  much 
harder.  This  is  due  to  the  fact  that  there  is  no  easy  way  to  discretize  the  problem: 
the  inflection  points  of  the  shortest  path  are  not  restricted  to  a  finite  set  of  points, 
but  they  can  lie  anywhere  on  the  obstacle  edges.  Canny  [80]  proved  that  the 
problem  of  computing  a  shortest  path  connecting  two  points  among  polyhedral 
obstacles  in  3-dimensional  space  is  NP-hard.  Reif  and  Storer  [327]  gave  a  single¬ 
exponential  algorithm  for  the  problem,  by  reducing  it  to  a  decision  problem 
in  the  theory  of  real  numbers.  There  are  also  several  papers  that  approximate 
the  shortest  path  in  polynomial  time,  for  example,  by  adding  points  on  obstacle 
edges  and  searching  a  graph  with  these  points  as  nodes  [13,  125,  126,  260,  316]. 


Section  15.4 
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Chapter  15 

VISIBILITY  GRAPHS 


In  this  chapter  we  concentrated  on  the  Euclidean  metric.  Various  papers  deal 
with  shortest  paths  under  a  different  metric.  Because  the  number  of  settings  is 
quite  large,  we  mention  only  a  few,  and  we  give  only  a  few  references  for  each 
setting.  An  interesting  metric  that  has  been  studied  is  the  link  metric,  where 
the  length  of  a  polygonal  path  is  the  number  of  links  it  consists  of  [20,  122, 
284,  367].  Another  case  that  has  been  studied  extensively  is  that  of  rectilinear 
paths.  Such  paths  play  an  important  role  in  VLSI  design,  for  instance.  Lee  et 
al.  [253]  give  a  survey  of  rectilinear  path  problems.  An  interesting  metric  that 
has  been  studied  for  rectilinear  paths  is  the  combined  metric,  which  is  a  linear 
combination  of  the  Euclidean  metric  and  the  link  metric  [56] .  Einally,  there 
are  papers  that  consider  paths  in  a  subdivision  where  each  region  has  a  weight 
associated  with  it.  The  cost  of  a  path  through  a  region  is  then  its  Euclidean 
length  times  the  weight  of  the  region.  Obstacles  can  be  modeled  by  regions 
with  infinite  weight  [113,  283]. 

While  there  are  many  obvious  metrics  for  translating  robots — in  particular, 
the  Euclidean  metric  immediately  comes  to  mind — it  is  not  so  easy  to  give  a 
good  definition  of  a  shortest  path  for  a  robot  that  can  rotate  as  well  as  trans¬ 
late.  Some  results  have  been  obtained  for  the  case  where  the  robot  is  a  line 
segment  [24,  114,  218], 

The  visibility  graph  was  introduced  for  planning  motions  by  Nilson  [295].  The 
0{n 2  log n)  time  algorithm  that  we  described  to  compute  it  is  due  to  Lee  [247].  A 
number  of  faster  algorithms  are  known  [23,  383],  including  an  optimal  algorithm 
by  Ghosh  and  Mount  [190],  which  runs  in  0(n\ogn  +  k)  time,  where  k  is  the 
number  of  arcs  of  the  visibility  graph. 

To  compute  a  shortest  path  for  a  point  robot  among  a  set  of  convex  polygonal 
obstacles,  one  does  not  need  all  the  visibility  edges.  One  only  needs  the  visibility 
edges  that  define  a  common  tangent.  Rohnert  [329]  gave  an  algorithm  that 
computes  this  reduced  visibility  graph  in  time  0(n  +  c2log«),  where  c  is  the 
number  of  obstacles,  and  n  is  their  total  number  of  edges. 

The  visibility  complex,  introduced  by  Vegter  and  Pocchiola  [319,  320,  376] 
is  a  structure  that  has  the  same  complexity  as  the  visibility  graph,  but  contains 
more  information.  It  is  defined  on  a  set  of  convex  (not  necessarily  polygonal) 
objects  in  the  plane,  and  can  be  used  for  shortest  path  problems  and  ray  shooting. 
It  can  be  computed  in  0{n  logn  +  k)  time.  Wein  et  al.  [382]  introduced  an 
interesting  variant  of  this,  the  visibility-Voronoi  complex,  which  combines  the 
visibility  complex  with  the  Voronoi  diagram  of  the  obstacles.  This  allows  one 
to  find  short  paths  that  do  not  come  too  close  to  the  obstacles. 


15.5  Exercises 

15.1  Let  S  be  a  set  of  disjoint  simple  polygons  in  the  plane  with  n  edges  in 
total.  Prove  that  for  any  start  and  goal  position  the  number  of  segments 
on  the  shortest  path  is  bounded  by  0(n).  Give  an  example  where  it 
is  ©(h). 
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15.2  Algorithm  VisibilityGraph  calls  algorithm  VisibleVertices  with 
each  obstacle  vertex.  VISIBLEVERTICES  sorts  all  vertices  around  its 
input  point.  This  means  that  n  cyclic  sortings  are  done,  one  around  each 
obstacle  vertex.  In  this  chapter  we  simply  did  every  sort  in  O(nlogn) 
time,  leading  to  0(n2  logn)  time  for  all  sortings.  Show  that  this  can  be 
improved  to  0{n2)  time  using  dualization  (see  Chapter  8).  Does  this 
improve  the  running  time  of  VisibilityGraph? 

15.3  The  algorithm  for  finding  the  shortest  path  can  be  extended  to  objects 
other  than  polygons.  Let  S  be  a  set  of  n  disjoint  disc-shaped  obstacles, 
not  necessarily  of  equal  radius. 

a.  Prove  that  in  this  case  the  shortest  path  between  two  points  not  seeing 
each  other  consists  of  parts  of  boundaries  of  the  discs,  and/or  common 
tangents  of  discs,  and/or  tangents  from  the  start  or  goal  point  to  the 
discs. 

b.  Adapt  the  notion  of  a  visibility  graph  for  this  situation. 

c.  Adapt  the  shortest  path  algorithm  to  find  the  shortest  path  between 
two  points  among  the  discs  in  S. 

15.4  What  is  the  maximal  number  of  shortest  paths  connecting  two  fixed 
points  among  a  set  of  n  triangles  in  the  plane? 

15.5  Let  S  be  a  set  of  disjoint  polygons  and  let  a  starting  point  psta rt  be  given. 
We  want  to  preprocess  the  set  S  (and  /;>start)  such  that  for  different  goal 
points  we  can  efficiently  find  the  shortest  path  from  pstart  to  the  goal. 
Describe  how  the  preprocessing  can  be  done  in  time  0(ir  logn)  such 
that  for  any  given  goal  point  pgoa\  we  can  compute  the  shortest  path  from 
P start  to  pgoai  in  time  O(nlogn). 

15.6  Design  an  algorithm  to  find  a  shortest  paths  between  two  points  inside  a 
simple  polygon.  Your  algorithm  should  run  in  subquadratic  time. 

15.7  When  all  obstacles  are  convex  polygons  we  can  improve  the  shortest 
path  algorithm  by  only  considering  common  tangents  rather  than  all 
visibility  edges. 

a.  Prove  that  the  only  visibility  edges  that  are  required  in  the  shortest 
path  algorithm  are  the  common  tangents  of  the  polygons. 

b.  Give  a  fast  algorithm  to  find  the  common  tangents  of  two  disjoint 
convex  polygons. 

c.  Give  an  algorithm  to  compute  those  common  tangents  that  are  also 
visibility  edges  among  a  set  of  convex  polygons. 

15.8*  If  you  are  familiar  with  homogeneous  coordinates,  it  is  interesting  to  see 
that  the  rotational  sweep  that  we  used  in  this  chapter  can  be  transformed 
into  a  normal  plane  sweep  using  a  horizontal  line  that  translates  over  the 
plane.  Show  that  this  is  the  case  using  a  projective  transformation  that 
moves  the  center  of  the  sweep  to  a  point  at  infinity. 
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16  Simplex  Range  Searching 

Windowing  Revisited 


In  Chapter  2  we  saw  that  geographic  information  systems  often  store  a  map  in  a 
number  of  separate  layers.  Each  layer  represents  a  theme  from  the  map,  that  is, 
a  specific  type  of  feature  such  as  roads  or  cities.  Distinguishing  layers  makes  it 
easy  for  the  user  to  concentrate  her  attention  on  a  specific  feature.  Sometimes 
one  is  not  interested  in  all  the  features  of  a  given  type,  but  only  in  the  ones  lying 
inside  a  certain  region.  Chapter  10  contains  an  example  of  this:  from  a  road 
map  of  the  whole  of  the  U.S.A.  we  wanted  to  select  the  part  lying  inside  a  much 
smaller  region.  There  the  query  region,  or  window,  was  rectangular,  but  it  is 
easy  to  imagine  situations  where  the  region  has  a  different  shape.  Suppose  that 


Figure  16.1 

Population  density  of  the  Netherlands 


we  have  a  map  layer  whose  theme  is  population  density.  The  density  is  shown 
on  the  map  by  plotting  a  point  for  every  5,000  people,  say.  An  example  of  such 
a  map  is  given  in  Figure  16.1.  If  we  want  to  estimate  the  impact  of  building,  say, 
a  new  airport  at  a  given  location,  it  is  useful  to  know  how  many  people  live  in 
the  affected  area.  In  geometric  terms  we  have  a  set  of  points  in  the  plane  and 
we  want  to  count  the  points  lying  inside  a  query  region  (for  instance,  the  region 
within  which  the  noise  of  planes  exceeds  a  certain  level). 

In  Chapter  5,  where  we  studied  database  queries,  we  developed  a  data 
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Chapter  16  structure  to  report  the  points  inside  an  axis-parallel  query  rectangle.  The  area 
simplex  range  searching  affected  by  the  airport,  however,  is  determined  by  the  dominating  direction  of 

the  wind,  and  unlikely  to  be  rectangular,  so  the  data  sttuctures  from  Chapter  5 
are  of  little  help  here.  We  need  to  develop  a  data  structure  that  can  handle  more 
general  query  ranges. 


16.1  Partition  Trees 
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Given  a  set  of  points  in  the  plane,  we  want  to  count  the  points  lying  inside  a 
query  region.  (Here  and  in  the  following  we  follow  the  convention  of  using  the 
expression  “to  count  the  points”  in  the  sense  of  “to  report  the  number  of  points”, 
not  in  the  sense  of  enumerating  the  points.)  Let’s  assume  that  the  query  region 
is  a  simple  polygon;  if  it  is  not,  we  can  always  approximate  it.  To  simplify 
the  query  answering  algorithm  we  first  triangulate  the  query  region,  that  is,  we 
decompose  it  into  triangles.  Chapter  3  describes  how  to  do  this.  After  we  have 
triangulated  the  region,  we  query  with  each  of  the  resulting  Uiangles.  The  set 
of  points  lying  inside  the  region  is  just  the  union  of  the  sets  of  points  inside 
the  triangles.  When  counting  the  points  we  have  to  be  a  bit  careful  with  points 
lying  on  the  common  boundary  of  two  Uiangles,  but  that  is  not  difficult  to  take 
care  of. 

We  have  arrived  at  the  triangular  range  searching  problem :  given  a  set  S  of  n 
points  in  the  plane,  count  the  points  from  S  lying  in  a  query  triangle  t.  Let’s 
first  study  a  slightly  simpler  version  of  this  problem,  where  the  query  triangle 
degenerates  into  a  half-plane.  What  should  a  data  structure  for  half-plane  range 
queries  look  like?  As  a  warm-up,  let’s  look  at  the  one-dimensional  problem 
first:  in  the  one-dimensional  version  of  the  problem  we  are  given  a  set  of  n 
points  on  the  real  line,  and  we  want  to  count  the  points  in  a  query  half-line 
(that  is,  the  points  lying  on  a  specified  side  of  a  query  point).  Using  a  balanced 
binary  search  tree  where  every  node  also  stores  the  number  of  points  in  its 
subtree,  we  can  answer  such  queries  in  (9 (log/;)  time.  How  can  we  generalize 
this  to  a  2-dimensional  setting?  To  answer  this  question  we  must  first  interpret 
a  balanced  search  tree  geometrically.  Each  node  of  the  tree  contains  a  key — the 
coordinate  of  a  point — that  is  used  to  split  the  point  set  into  the  sets  that  are 
stored  in  the  left  and  right  subtrees.  Similarly,  we  can  consider  this  key  value  to 
split  the  real  line  into  two  pieces.  In  this  way,  every  node  of  the  tree  corresponds 
to  a  region  on  the  line — the  root  to  the  whole  line,  the  two  children  of  the 
root  to  two  half-lines,  and  so  on.  For  any  query  half-line  and  any  node,  the 
region  of  one  child  of  the  node  is  either  completely  contained  in  the  half-line  or 
completely  disjoint  from  it.  All  points  in  that  region  are  in  the  half-line,  or  none 
of  them  is.  Hence,  we  only  have  to  search  recursively  in  the  other  subtree  of 
the  node.  This  is  shown  in  Figure  16.2.  The  points  are  drawn  below  the  tree  as 
black  dots;  the  two  regions  of  the  real  line  corresponding  to  the  two  subtrees  are 
also  indicated.  The  query  half-line  is  the  grey  region.  The  region  corresponding 
to  the  subtree  rooted  at  the  black  node  is  completely  inside  the  query  half-line. 
Hence,  we  only  have  to  recurse  on  the  right  subtree. 


recursively  visited  subtree 
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Figure  16.2 

Answering  a  half-line  query  with  a 
binary  tree 


To  generalize  this  to  two  dimensions  we  could  try  to  partition  the  plane 
into  two  regions,  such  that  for  any  query  half-plane  there  is  one  region  that 
is  either  completely  contained  in  the  half-plane  or  completely  disjoint  from 
it.  Unfortunately,  such  a  partitioning  does  not  exist,  so  we  need  a  further 
generalization:  instead  of  partitioning  into  two  regions,  we  must  partition  into 
more  regions.  The  partitioning  should  be  such  that  for  any  query  half-plane  we 
have  to  search  recursively  in  only  few  of  the  regions. 


We  now  give  a  formal  definition  of  the  type  of  partitioning  we  need.  A  sim- 
plicial  partition  for  a  set  S  of  n  points  in  the  plane  is  a  collection  'l1  (  .S')  := 
{(S1T1),. . . ,  (Sr,tr)},  where  the  5',’s  are  disjoint  subsets  of  S  whose  union  is  S , 
and  tj  is  a  triangle  containing  .S',.  The  subsets  .S',  are  called  classes.  We  do  not 
require  the  triangles  to  be  disjoint,  so  a  point  of  5  may  lie  in  more  than  one 
triangle.  Still,  such  a  point  is  a  member  of  only  one  class.  We  call  r,  the  number 
of  triangles,  the  size  of  'fb.S').  Figure  16.3  gives  an  example  of  a  simplicial 
partition  of  size  five;  different  shades  of  grey  are  used  to  indicate  the  different 
classes.  We  say  that  a  line  £  crosses  a  triangle  f,  if  £  intersects  the  interior 


Figure  16.3 

A  fine  simplicial  partition 


of  tj.  When  the  point  set  S  is  not  in  general  position  we  sometimes  need  to 
use  (relatively  open)  segments  as  well  as  triangles  in  the  simplicial  partition. 
A  line  is  defined  to  cross  a  segment  when  it  intersects  its  relative  interior  but 
does  not  contain  it.  The  crossing  number  of  a  line  £  with  respect  to  'lb. S')  is 
the  number  of  triangles  of  'lb  . S’)  crossed  by  £.  Thus  the  crossing  number  of  the 
line  £  in  Figure  16.3  is  two.  The  crossing  number  of  'lb' .S’)  is  the  maximum 
crossing  number  over  all  possible  lines  £.  In  Figure  16.3  you  can  find  lines 
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Chapter  16  that  intersect  four  triangles,  but  no  line  intersects  all  five.  Finally,  we  say  that 
simplex  range  searching  a  simplicial  partition  is  fine  if  |S,j  ^  2 n/r  for  every  1  ^  i  ^  r.  In  other  words. 

in  fine  simplicial  partitions  none  of  the  classes  contains  more  than  twice  the 
average  number  of  points  of  the  classes. 


Now  that  we  have  formalized  the  notion  of  partition,  let’s  see  how  we  can 
use  such  a  partition  to  answer  half-plane  range  queries.  Let  h  be  the  query 
half-plane.  If  a  triangle  of  the  partition  is  not  crossed  by  the  bounding  line  of 
h ,  then  its  class  .S',  either  lies  completely  in  h ,  or  it  is  completely  disjoint  from  h. 
This  means  we  only  have  to  recurse  on  the  classes  Sj  for  which  f,  is  crossed  by 
the  bounding  line  of  h.  For  example,  if  in  Figure  16.3  we  queried  with  f+,  the 
half-plane  lying  above  l,  we  would  have  to  recurse  on  two  of  the  five  classes. 
The  efficiency  of  the  query  answering  process  therefore  depends  on  the  crossing 
number  of  the  simplicial  partition:  the  lower  the  crossing  number,  the  better 
the  query  time.  The  following  theorem  states  that  it  is  always  possible  to  find  a 
simplicial  partition  with  crossing  number  0(y/r);  later  we  shall  see  what  this 
implies  for  the  query  time. 

Theorem  16.1  For  any  set  S  of  n  points  in  the  plane,  and  any  parameter  r  with 
1  ^  r  ^  n,  a  fine  simplicial  partition  of  size  r  and  crossing  number  0(  x/r) 
exists.  Moreover,  for  any  e  >  0  such  a  simplicial  partition  can  be  constructed  in 
time  0(n1+e). 

It  seems  a  bit  strange  to  claim  a  construction  time  of  0(nL1)  or  O(nL01)  or 
with  the  exponent  even  closer  to  1 .  Still,  no  matter  how  small  e  is,  as  long  as 
it  is  a  positive  constant,  the  bound  in  the  theorem  can  be  attained.  But  a  better 
upper  bound  like  O(n)  or  O(nlogn)  isn’t  claimed  in  the  theorem. 

Section  16.4  gives  pointers  to  the  literature  where  a  proof  of  this  theorem 
can  be  found.  We  shall  take  the  theorem  for  granted,  and  concentrate  on  how  to 
use  it  in  the  design  of  an  efficient  data  structure  for  half-plane  range  queries.  The 
data  structure  we’ll  obtain  is  called  a  partition  tree.  Probably  you  can  already 


Figure  16.4 
A  simplicial  partition  and  the 
corresponding  tree 


guess  what  such  a  partition  tree  looks  like:  it’s  a  tree  whose  root  has  r  children, 
each  being  the  root  of  a  recursively  defined  partition  tree  for  one  of  the  classes 
in  a  simplicial  partition.  There  is  no  specific  order  on  the  children;  it  happens  to 
be  irrelevant.  Figure  16.4  shows  a  simplicial  partition  and  the  corresponding 
tree.  The  dotted  triangles  form  the  partition  computed  recursively  for  the  class 

_  corresponding  to  the  middle  child  of  the  root;  the  five  “subclasses”  are  stored 
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store  some  extra  information  about  the  classes.  The  basic  structure  of  a  partition 
tree  is  thus  as  follows: 

■  If  S  contains  only  one  point,  p,  the  partition  tree  consists  of  a  single  leaf 
where  p  is  stored  explicitly.  The  set  S  is  the  canonical  subset  of  the  leaf. 

■  Otherwise,  the  structure  is  a  tree  7  of  branching  degree  r,  where  r  is  a 
sufficiently  large  constant.  (Below  we  shall  see  how  r  should  be  chosen.) 
The  children  of  the  root  of  the  tree  are  in  one-to-one  correspondence  with  the 
triangles  of  a  fine  simplicial  partition  of  size  r  for  the  set  S.  The  triangle  of 
the  partition  corresponding  to  child  V  is  denoted  by  t (v).  The  corresponding 
class  in  5  is  called  the  canonical  subset  of  v;  it  is  denoted  S( V).  The  child 
V  is  the  root  of  a  recursively  defined  partition  tree  Tv  on  the  set  S(v). 

■  With  each  child  v  we  store  the  triangle  t(v).  We  also  store  information 
about  the  subset  S(v);  for  half-plane  range  counting  this  information  is  the 
cardinality  of  S(v),  but  for  other  applications  we  may  want  to  store  other 
information. 

We  can  now  describe  the  query  algorithm  for  counting  the  number  of  points 
from  S  in  a  query  half-plane  h.  The  algorithm  returns  a  set  T  of  nodes  from  the 
partition  tree  7,  called  the  selected  nodes ,  such  that  the  subset  of  points  from 
S  lying  in  h  is  the  disjoint  union  of  the  canonical  subsets  of  the  nodes  in  T.  In 
other  words,  T  is  a  set  of  nodes  whose  canonical  subsets  are  disjoint,  and  such 
that 

snh  =  (Js(v). 

VGT 

The  selected  nodes  are  exactly  the  nodes  v  with  the  property:  f(v)  C  h  (or,  in 
case  v  is  a  leaf,  the  point  stored  at  v  lies  in  h)  and  there  is  no  ancestor  it  of  v 
such  that  t  (p)  C  h.  The  number  of  points  in  h  can  be  computed  by  summing 
the  cardinalities  of  the  selected  canonical  subsets. 

Algorithm  SelectInHalfplane(/z,  7) 

Input.  A  query  half-plane  h  and  a  partition  tree  or  subtree  of  it. 

Output.  A  set  of  canonical  nodes  for  all  points  in  the  tree  that  lie  in  h. 

1.  Y<-0 

2.  if  7  consists  of  a  single  leaf  p 

3.  then  if  the  point  stored  at  it  lies  in  h  then  Y  <—  {/i} 

4.  else  for  each  child  v  of  the  root  of  7 

5.  doiff(v)c/z 

6.  then  Y  <—  Y  U{v} 

7.  else  iff(v)n/z^0 

8.  thenY  <—  Y  U  SelectInHalfplane(/z,7v) 

9.  return  Y 
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Figure  16.5  illustrates  the  operation  of  the  query  algorithm.  The  selected  chil¬ 
dren  of  the  root  are  shown  in  black.  The  children  that  are  visited  recursively  (as 
well  as  the  root  itself,  since  it  has  also  been  visited)  are  grey.  As  said  before,  a 
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Chapter  16 

SIMPLEX  RANGE  SEARCHING 


Figure  16.5 

Answering  a  half-plane  range  query 
using  a  partition  tree 


half-plane  range  counting  query  can  be  answered  by  calling  SelectInHalf- 
PLANE  and  summing  the  cardinalities  of  the  selected  nodes,  which  are  stored 
at  the  nodes.  In  practice,  one  would  probably  not  keep  track  of  the  set  T,  but 
one  would  maintain  a  counter;  when  a  node  is  selected,  the  cardinality  of  its 
canonical  subset  is  added  the  counter. 

We  have  described  the  partition  tree,  a  data  structure  for  half-plane  range 
counting,  and  its  query  algorithm.  Now  it’s  time  to  analyze  our  structure.  We 
start  with  the  amount  of  storage. 

Lemma  16.2  Let  S  be  a  set  of  u  points  in  the  plane.  A  partition  tree  on  S  uses 
0(n)  storage. 

Proof.  Let  M(n)  be  the  maximum  number  of  nodes  that  a  partition  tree  on  a  set 
of  n  points  can  have,  and  let  n  v  denote  the  cardinality  of  the  canonical  subset 
S(v).  Then  M(n)  satisfies  the  following  recurrence: 

x  .  J  1  if  n  =  1, 

M{  ] ''  \  1  +£vM(nv)  if  n  >  1, 

where  we  sum  over  all  children  v  of  the  root  of  the  tree.  Because  the  classes  in 
a  simplicial  partition  are  disjoint,  we  have  £v«v  =  n.  Furthermore,  nv  f  2n / r 
for  all  v.  Hence,  for  any  constant  r  >  2  the  recurrence  solves  to  M in )  =  0(n). 

The  storage  needed  for  a  single  node  of  the  tree  is  O(r).  Since  r  is  a  constant, 
the  lemma  follows.  0 

Linear  storage  is  the  best  one  could  hope  for,  but  what  about  the  query  time? 

_  Here  it  becomes  important  what  the  exact  value  of  r  is.  When  we  perform  a 
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not  depend  on  r  or  n.  It  turns  out  that  this  constant  has  an  influence  on  the 
exponent  of  n  in  the  query  time.  To  decrease  this  influence,  we  need  to  make  r 
large  enough,  and  then,  as  we  will  see,  we  can  get  a  query  time  close  to  0(y/n). 

Lemma  16.3  Let  S  be  a  set  of  n  points  in  the  plane.  For  any  e  >  0,  there  is  a 
partition  tree  for  S  such  that  for  a  query  half-plane  h  we  can  select  0(n]  /2+e) 
nodes  from  the  tree  with  the  property  that  the  subset  of  points  from  S  in  h  is  the 
disjoint  union  of  the  canonical  subsets  of  the  selected  nodes.  The  selection  of 
these  nodes  takes  0(nl/2+£)  time.  As  a  consequence,  half-plane  range  counting 
queries  can  be  answered  in  0(«1/2+e)  time. 

Proof.  Let  e  >  0  be  given.  According  to  Theorem  16.1  there  is  a  constant  c 
such  that  for  any  parameter  r  we  can  construct  a  simplicial  partition  of  size  r 
with  crossing  number  at  most  C\Jr.  We  base  the  partition  tree  on  simplicial 
partitions  of  size  r  :=  \2{c\/2)xle^.  Let  Q(n)  denote  the  maximum  query  time 
for  any  query  in  a  tree  for  a  set  of  n  points.  Let  h  be  a  query  half-plane,  and  let 
nv  denote  the  cardinality  of  the  canonical  subset  S(v).  Then  Q(n)  satisfies  the 
following  recurrence: 

/  1  if  n  =  1, 

r  +  IveCW£(nv)  if  n  >  1, 

where  we  sum  over  the  set  C(h)  of  all  children  v  of  the  root  such  that  the 
boundary  of  h  crosses  f(v).  Because  the  simplicial  partition  underlying  the  data 
structure  has  crossing  number  C\fr,  we  know  that  the  number  of  nodes  in  the 
set  C(h)  is  at  most  c.\Jr.  We  also  know  that  nv  f  2 n  jr  for  each  v,  because  the 
simplicial  partition  is  fine.  Using  these  two  facts  one  can  show  that  with  our 
choice  of  r  the  recurrence  for  Q{n)  solves  to  0(n,,/2+£).  0 

You  may  be  a  bit  disappointed  by  the  query  time:  the  query  time  of  most 
geometric  data  structures  we  have  seen  up  to  now  is  O(logn)  or  a  polynomial  in 
logn,  whereas  the  query  time  for  the  partition  tree  is  around  O(yfi).  Apparently 
this  is  the  price  we  have  to  pay  if  we  want  to  solve  truly  2-dimensional  query 
problems,  such  as  half-plane  range  counting.  Is  it  impossible  to  answer  such 
queries  in  logarithmic  time?  No:  later  in  this  chapter  we  shall  design  a  data 
structure  for  half-plane  range  queries  with  logarithmic  query  time.  But  the 
improvement  in  query  time  will  not  come  for  free,  as  that  data  structure  will 
need  quadratic  storage. 

It  is  helpful  to  compare  the  approach  we  have  taken  here  with  the  range  trees 
from  Chapter  5  and  the  segment  trees  from  Chapter  10.  In  these  data  structures, 
we  would  like  to  return  information  about  a  subset  of  a  given  set  of  geometric 
objects  (points  in  range  trees  and  partition  trees,  intervals  in  segment  trees),  or 
to  report  the  subset  itself.  If  we  could  precompute  the  requested  information 
for  every  possible  subset  that  can  appear  in  a  query,  queries  could  be  answered 
very  fast.  However,  the  number  of  possible  different  answers  often  prohibits 
such  an  approach.  Instead,  we  identified  what  we  have  called  canonical  subsets, 
and  we  precomputed  the  required  information  for  these  subsets  only.  A  query  is 
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Chapter  16  then  solved  by  expressing  the  answer  to  the  query  as  the  disjoint  union  of  some 

simplex  range  searching  of  these  canonical  subsets.  The  query  time  is  roughly  linear  in  the  number  of 

canonical  subsets  that  are  required  to  express  any  possible  query  subset.  The 
storage  is  proportional  to  the  total  number  of  precomputed  canonical  subsets 
for  range  trees  and  partition  trees,  and  proportional  to  the  sum  of  the  sizes 
of  the  precomputed  canonical  subsets  for  segment  trees.  There  is  a  trade-off 
between  query  time  and  storage:  to  make  sure  that  every  possible  query  can 
be  expressed  as  the  union  of  only  a  few  canonical  subsets,  we  need  to  provide 
a  large  repertoire  of  such  subsets,  and  need  a  lot  of  storage.  To  decrease  the 
storage,  we  need  to  decrease  the  number  of  precomputed  canonical  subsets — but 
that  may  mean  that  the  number  of  canonical  subsets  needed  to  express  a  given 
query  will  be  larger,  and  the  query  time  increases. 

This  phenomenon  can  be  observed  clearly  for  2-dimensional  range  search¬ 
ing:  the  partition  tree  we  constructed  in  this  section  provides  a  repertoire  of 
only  0(n)  canonical  subsets,  and  needs  only  linear  storage,  but  in  general  one 
needs  canonical  subsets  to  express  the  set  of  points  lying  in  a  half-plane. 

Only  by  providing  roughly  a  quadratic  number  of  canonical  subsets  can  one 
achieve  logarithmic  query  time. 


Now  let’s  go  back  to  the  problem  that  we  wanted  to  solve,  namely  triangular 
range  queries.  Which  modifications  do  we  need  if  we  want  to  use  partition  trees 
for  triangles  instead  of  half-planes  as  query  regions?  The  answer  is  simple: 
none.  We  can  use  exactly  the  same  data  structure  and  query  algorithm,  with  the 
query  half-plane  replaced  by  a  query  triangle.  In  fact,  the  solution  works  for 
any  query  range  y.  The  only  question  is  what  happens  to  the  query  time. 

When  the  query  algorithm  visits  a  node,  there  are  three  types  of  children:  the 
children  v  for  which  t(v)  lies  completely  inside  the  query  range,  the  children  for 
which  f(v)  lies  outside  the  range,  and  the  children  for  which  f  (v)  lies  partially 
inside  the  query  range.  Only  the  children  of  the  third  type  have  to  be  visited 
recursively.  The  query  time  therefore  depends  on  the  number  of  triangles  in  the 
partition  that  are  crossed  by  the  boundary  of  the  query  range  y.  In  other  words, 
we  have  to  see  what  the  crossing  number  of  y  is  with  respect  to  the  simplicial 
partition.  For  a  triangular  query  region,  this  is  easy:  a  triangle  in  the  partition 
is  crossed  by  the  boundary  of  y  only  if  it  is  crossed  by  one  of  the  three  lines 
through  the  edges  of  y.  Since  each  one  of  these  lines  intersects  at  most  c^fr 
triangles,  the  crossing  number  of  y  is  at  most  3c\/r. 

The  recursion  for  the  query  time  therefore  remains  nearly  the  same,  only 
the  constant  c  changes  to  3c.  As  a  result,  we  will  need  to  choose  r  larger,  but  in 
the  end  the  query  time  remains  asymptotically  the  same.  We  get  the  following 
theorem: 

Theorem  16.4  Let  S  be  a  set  of  n  points  in  the  plane.  For  any  e  >  0,  there  is  a 
data  structure  for  S,  called  a  partition  tree,  that  uses  0(n)  storage,  such  that  the 
points  from  S  lying  inside  a  query  triangle  can  be  counted  in  0(nl'2+£)  time. 
The  points  can  be  reported  in  O(k)  additional  time,  where  k  is  the  number  of 
reported  points.  The  structure  can  be  constructed  in  0(n1+e)  time. 
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Proof.  The  only  two  issues  that  have  not  been  discussed  yet  are  the  construction  Section  16.2 
time  and  the  reporting  of  points.  multi-level  partition  trees 

Constructing  a  partition  tree  is  easy:  the  recursive  definition  given  before 
immediately  implies  a  recursive  construction  algorithm.  We  denote  the  time  this 
algorithm  needs  to  construct  a  partition  tree  for  a  set  of  n  points  by  T  (. n ).  Let 
e  >  0  be  given.  According  to  Theorem  16.1  we  can  construct  a  fine  simplicial 
partition  for  S  of  size  r  with  crossing  number  0(\fr)  in  time  0(n 1  1  f  ),  for  any 
s!  >  0.  We  let  s!  =  e/2.  Hence,  T(n)  satisfies  the  recurrence 


Tin)  = 


0(1)  if  n  =  1, 

0(n1+e/2)  +  £v  T  (nv)  ifn>l, 


where  we  sum  over  all  children  v  of  the  root  of  the  tree.  Because  the  classes  in 
a  simplicial  partition  are  disjoint,  we  have  Y.vnv  =  n,  and  the  recurrence  solves 
to  T(n)  =  0(nl+e). 

It  remains  to  show  that  the  k  points  in  a  query  triangle  can  be  reported  in 
0(k)  additional  time.  These  points  are  stored  in  the  leaves  below  the  selected 
nodes.  Hence,  they  can  be  reported  by  traversing  the  subtrees  rooted  at  the 
selected  nodes.  Because  the  number  of  internal  nodes  of  a  tree  is  linear  in  the 
number  of  leaves  of  that  tree  when  each  internal  node  has  degree  at  least  two, 
this  takes  time  linear  in  the  number  of  reported  points.  Li 
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Partition  trees  are  powerful  data  structures.  Their  strength  is  that  the  points 
lying  in  a  query  half-plane  can  be  selected  in  a  small  number  of  groups,  namely, 
the  canonical  subsets  of  the  nodes  selected  by  the  query  algorithm.  In  the 
example  above  we  used  a  partition  tree  for  half-plane  range  counting,  so  the 
only  information  that  we  needed  about  the  selected  canonical  subsets  was  their 
cardinality.  In  other  query  applications  we  will  need  other  information  about 
the  canonical  subsets,  so  we  have  to  precompute  and  store  that  data.  The 
information  that  we  store  about  a  canonical  subset  does  not  have  to  be  a  single 
number,  like  its  cardinality.  We  can  also  store  the  elements  of  the  canonical 
subset  in  a  list,  or  a  tree,  or  any  kind  of  data  structure  we  like.  This  way  we  get 
a  multi-level  data  structure.  The  concept  of  multi-level  data  structures  is  not 
new:  we  already  used  it  in  Chapter  5  to  answer  multi-dimensional  rectangular 
range  queries  and  in  Chapter  10  for  windowing  queries. 

We  now  give  an  example  of  a  multi-level  data  structure  based  on  partition  trees. 
Let  S  be  a  set  of  n  line  segments  in  the  plane.  We  want  to  count  the  number  of 
segments  intersected  by  a  query  line  £.  Let  bright  (s)  and  Pieft(s)  denote  the  right 
and  left  endpoint  of  a  segment  s,  respectively.  A  line  £  intersects  s  if  and  only  if 
either  the  endpoints  of  ,v  lie  to  distinct  sides  of  £,  or  ,v  has  an  endpoint  on  £.  We 
show  how  to  count  the  number  of  segments  s  £  S  with  /Uightf.v)  lying  above  £ 
and  pieft(s)  lying  below  £.  The  segments  with  an  endpoint  on  £,  and  the  ones 
with  p right  (s)  lying  below  £  and  £>ieft(s)  above  £,  can  be  counted  with  a  similar 


Chapter  16  data  structure.  Here  we  choose — for  a  vertical  line  ( — the  left  side  to  be  below 
simplex  range  searching  £  and  the  right  side  above  l. 

The  idea  of  the  data  structure  is  simple.  We  first  find  all  the  segments  s  £  S 
such  that  p right  (s)  lies  above  t.  In  the  previous  section  we  saw  how  to  use  a 
partition  tree  to  select  these  segments  in  a  number  of  canonical  subsets.  For  each 
of  the  selected  canonical  subsets  we  are  interested  in  the  number  of  segments  s 
with  pieft(s)  below  £.  This  is  a  half-plane  range  counting  query,  which  can  be 
answered  if  we  store  each  canonical  subset  in  a  partition  tree.  Let’s  describe 
this  solution  in  a  little  more  detail.  The  data  structure  is  defined  as  follows. 
For  a  set  S'  of  segments,  let  Fright^)  :=  {pright(s)  :  s  £  S1}  be  the  set  of  right 
endpoints  of  the  segments  in  S',  and  let  Pieft(S')  :=  {pieft(-s)  :  s  £  S1}  be  the  set 
of  left  endpoints  of  the  segments  in  S'. 

m  The  set  Fright  (S)  is  stored  in  a  partition  tree  T.  The  canonical  subset  of  a 
node  v  of  T  is  denoted  Fright(v).  The  set  of  segments  corresponding  to  the 
left  endpoints  in  Fright(v)  is  denoted  S(v),  that  is,  S(v)  =  {s  :  pright(s)  €E 
/nght ft')}.  (Abusing  the  terminology  slightly,  we  sometimes  call  S(v)  the 
canonical  subset  of  v.) 

■  With  each  node  v  of  the  first-level  tree  T,  we  store  the  set  F|eft(5'(v))  in 
a  second-level  partition  tree  TySSOC  for  half-plane  range  counting.  This 
partition  tree  is  the  associated  structure  of  v. 

With  this  data  structure  we  can  select  the  segments  s  £  S  with  bright  (•5)  above  £ 
and  pieft(s)  below  l  in  a  number  of  canonical  subsets.  The  query  algorithm  for 
this  is  described  below.  To  count  the  number  of  such  segments,  all  we  have  to 
do  is  sum  the  cardinalities  of  the  selected  subsets.  Let  7V  denote  the  subtree  of 
T  rooted  at  v. 

Algorithm  SelectIntSegments(£,‘T) 

Input.  A  query  line  t  and  a  partition  tree  or  subtree  of  it. 

Output.  A  set  of  canonical  nodes  for  all  segments  in  the  tree  that  are  intersected 
by  £. 

1.  T  <-0 

2.  if  7  consists  of  a  single  leaf  p 

3.  then  if  the  segment  stored  at  p  intersects  £  then  T  <—  {p} 

4.  else  for  each  child  v  of  the  root  of  T 

5.  doifr(v)cf+ 

6.  thenY  <—  Y  U  SELECTlNHALFPLANE(f~,TvSSOC) 

7.  else  ifr(v)nf^0 

8.  then  Y  <—  Y  U  SelectIntSegments(Y,Tv) 

9.  return  Y 

The  query  algorithm  just  given  can  find  the  segments  with  the  right  endpoint 
above  the  query  line  and  the  left  endpoint  below  it.  Interestingly,  the  same 
partition  tree  can  be  used  to  find  the  segments  with  the  left  endpoint  above  the 
query  line  and  the  right  endpoint  below  it.  Only  the  query  algorithm  has  to  be 
changed:  exchange  the  “£+”  and  the  “£~”  and  we  are  done. 
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Let’s  analyze  our  multi-level  partition  tree  for  segment  intersection  selection.  Section  16.2 
We  start  with  the  amount  of  storage.  multi-level  partition  trees 

Lemma  16.5  Let  S  be  a  set  of  n  segments  in  the  plane.  A  two-level  partition 
tree  for  segment  intersection  selection  queries  in  S  uses  0(n  log/;)  storage. 

Proof.  Let  z;v  denote  the  cardinality  of  the  canonical  subset  .S’ (  V j  in  the  first- 
level  partition  tree.  The  storage  for  this  node  consists  of  a  partition  tree  for  Sv , 
and  as  we  know  from  the  previous  section,  it  needs  linear  storage.  Hence,  the 
storage  M(n )  for  a  two-level  partition  tree  on  n  segments  satisfies  the  recurrence 

m(  1-  /  if  n  =  1, 

W  \  Lv[0(nv)+M(nv)}  if  zz  >  1, 

where  we  sum  over  all  children  v  of  the  root  of  the  tree.  We  know  that  £v  zz  v  =  zz 
and  zzv  ^  2zz/r.  Since  r  >  2  is  a  constant  the  recurrence  for  M(zz)  solves  to 
M(zz)  =  O(zzlogzz).  0 


Adding  a  second  level  to  the  partition  tree  has  increased  the  amount  of 
storage  by  a  logarithmic  factor.  What  about  the  query  time?  Surprisingly,  the 
asymptotic  query  time  does  not  change  at  all. 

Lemma  16.6  Let  S  be  a  set  of  zz  segments  in  the  plane.  For  any  e  >  0,  there 
is  a  two-level  partition  tree  for  S  such  that  for  a  query  line  £  we  can  select 
Ofzzl/2  '  £  )  nodes  from  the  tree  with  the  property  that  the  subset  of  segments 
from  S  intersected  by  £  is  the  disjoint  union  of  the  canonical  subsets  of  the 
selected  nodes.  The  selection  of  these  nodes  takes  0(zz1/2+£)  time.  As  a 
consequence,  the  number  of  intersected  segments  can  be  counted  in  0(n 1  /2f) 
time. 


Proof.  Again  we  use  a  recurrence  to  analyze  the  query  time.  Let  £  >  0  be  given. 
Let  zz v  denote  the  cardinality  of  the  canonical  subset  S(v).  Lemma  16.3  tells  us 
that  we  can  construct  the  associated  structure  Tf isoc  of  node  v  in  such  a  way  that 
the  query  time  in  Tfssoc  is  0(zzj/“+e).  Now  consider  the  full  two-level  tree  T  on 
S.  We  base  this  tree  on  a  fine  simplicial  partition  of  size  r  with  crossing  number 
at  most  c^fr,  for  r  :=  |"2(cv/2)1/£"| ;  such  a  partition  exists  by  Theorem  16.1.  Let 
Q(n)  denote  the  query  time  in  the  two-level  tree  for  a  set  of  zz  segments.  Then 
Q(n)  satisfies  the  recurrence: 


600 


0(1)  if  zz  =  1, 

0(rnl/2+e)  +L/^6(2zz/z")  if  zz  >  1 . 


With  our  choice  of  r  the  recurrence  for  Q(n)  solves  to  0(n]  1 '  £).  This  bound 
on  the  query  time  immediately  implies  the  bound  on  the  number  of  selected 
canonical  subsets.  0 
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Chapter  16  16.3  Cutting  Trees 

SIMPLEX  RANGE  SEARCHING 

In  the  previous  sections  we  have  solved  planar  range  searching  problems  with 
partition  trees.  The  storage  requirements  of  partition  trees  are  good:  they  use 
roughly  linear  storage.  The  query  time,  however,  is  0(nl  /2+£),  and  this  is  rather 
high.  Can  we  achieve  a  better  query  time,  for  example  O(logn),  if  we  are 
willing  to  spend  more  than  linear  storage?  To  have  any  hope  of  success,  we 
must  abandon  the  approach  of  using  simplicial  partitions:  it  is  not  possible  to 
construct  simplicial  partitions  with  less  than  0(\/r)  crossing  number,  which 
would  be  needed  to  achieve  a  query  time  faster  than  0(y/ri). 


Figure  16.6 

Half-plane  range  counting  in  the  dual 
plane:  how  may  lines  are  below  a 
query  point? 


To  come  up  with  a  new  approach  to  the  problem,  we  need  to  look  at  it  in 
a  different  light.  We  apply  the  duality  transform  from  Chapter  8.  The  first 
problem  we  solved  in  Section  16.1  was  the  half-plane  range  counting  problem: 
given  a  set  of  points,  count  the  number  of  points  lying  in  a  query  half-plane. 
Let’s  see  what  we  get  when  we  look  at  this  problem  in  the  dual  plane.  Assume 
that  the  query  half-plane  is  positive,  that  is,  we  want  to  count  the  points  above 
the  query  line.  In  the  dual  plane  we  then  have  the  following  setting:  given  a 
set  L  of  n  lines  in  the  plane,  count  the  number  of  lines  below  a  query  point  q. 
With  the  tools  we  constructed  in  the  previous  chapters  it  is  easy  to  design  a  data 
structure  with  logarithmic  query  time  for  this  problem:  the  key  observation  is 
that  the  number  of  lines  below  the  query  point  q  is  uniquely  determined  by  the 
face  of  the  arrangement  A(L)  that  contains  q.  Hence,  we  can  construct  A(L) 
and  preprocess  it  for  point  location  queries,  as  described  in  Chapter  6,  and  store 
with  each  face  the  number  of  lines  below  it.  Counting  the  number  of  lines  below 
a  query  point  now  boils  down  to  doing  point  location.  This  solution  uses  0(n2) 
storage  and  it  has  0(log«)  query  time. 

Note  that  this  was  a  situation  where  we  could  afford  to  precompute  the 
answer  for  every  possible  query — in  other  words,  the  collection  of  canonical 
subsets  consists  of  all  possible  subsets  that  can  appear.  But  if  we  go  to  triangular 
range  counting,  this  approach  is  not  so  good:  there  are  just  too  many  possible 
triangles  to  precompute  all  possible  answers.  Instead,  we’ll  try  to  express  the 
set  of  lines  below  a  query  point  by  a  small  number  of  canonical  subsets  in  a 

_  recursive  way.  We  can  then  use  the  multi-level  approach  from  the  previous 

346  section  to  solve  the  triangular  range  searching  problem. 


We  construct  the  whole  collection  of  canonical  subsets  using  a  data  structure  Section  16.3 
called  a  cutting  tree.  The  idea  behind  cutting  trees  is  the  same  as  for  partition  cutting  trees 
trees:  the  plane  is  partitioned  into  triangular  regions,  as  depicted  in  Figure  16.7. 

This  time,  however,  we  require  that  the  triangles  be  disjoint.  How  can  such 


Figure  16.7 

A  (1/2) -cutting  of  size  ten  for  a  set  of 
six  lines 


a  partitioning  help  to  count  the  number  of  lines  below  a  query  point?  Let 
L  :=  { T| ,  £2 -  -  -  - , }  be  the  set  of  lines  that  we  obtained  after  dualizing  the 
points  to  be  preprocessed  for  triangular  range  queries.  Consider  a  triangle  t  of 
the  partitioning,  and  a  line  that  does  not  intersect  t.  If  /:)■  lies  below  t,  then  £,■ 
lies  below  any  query  point  inside  t.  Similarly,  if  lies  above  t,  it  lies  above 
any  query  point  inside  t.  This  means  that  if  our  query  point  q  lies  in  t,  then 
the  only  lines  of  which  we  don’t  know  yet  whether  they  lie  above  or  below  q 
are  the  ones  that  intersect  t.  Our  data  structure  will  store  each  triangle  of  the 
partitioning,  with  a  counter  indicating  the  number  of  lines  below  it;  for  each 
triangle  we  also  have  a  recursively  defined  structure  on  the  lines  intersecting 
it.  To  query  in  this  structure,  we  first  determine  in  which  triangle  t  the  query 
point  q  falls.  We  then  compute  how  many  lines  from  the  ones  that  intersect  t  are 
below  q,  by  recursively  visiting  the  subtree  corresponding  to  t.  Finally,  we  add 
the  number  we  computed  in  the  recursive  call  to  the  number  of  lines  below  t. 
The  efficiency  of  this  approach  depends  on  the  number  of  lines  intersecting  a 
triangle:  the  smaller  this  number,  the  fewer  lines  on  which  we  have  to  recurse. 
We  now  formally  define  the  kind  of  partitioning  we  need. 


Let  L  be  a  set  of  n  lines  in  the  plane,  and  let  r  be  a  parameter  with  1  ^  r  ^  n.  A 
line  is  said  to  cross  a  triangle  if  it  intersects  the  interior  of  the  triangle.  A  (1  jr)- 
cutting  for  L  is  a  set  E(L)  :=  { /  | ,  f 2 ■  •  •  ■ ,  t,N }  of  possibly  unbounded  triangles 
with  disjoint  interiors  that  together  cover  the  plane,  with  the  property  that  no 
triangle  of  the  partitioning  is  crossed  by  more  than  n/r  lines  from  L.  The  size 

of  the  cutting  E(L)  is  the  number  of  triangles  it  consists  of.  Figure  16.7  gives  _ 

an  example  of  a  cutting.  347 
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SIMPLEX  RANGE  SEARCHING 


Figure  16.8 

The  canonical  subsets  and  the  crossing 
subset  for  a  triangle 


Theorem  16.7  For  any  set  L  of  n  lines  in  the  plane,  and  any  parameter  r  with 
1  ^  r  ^  n,  a  (1  / r) -cutting  of  size  0(r2)  exists.  Moreover,  such  a  cutting  (with 
for  each  triangle  in  the  cutting  the  subset  of  lines  from  L  that  cross  it)  can  be 
constructed  in  0(nr)  time. 

In  Section  16.4  references  are  given  to  the  papers  where  this  theorem  is 
proved.  We  shall  only  concern  ourselves  with  how  cuttings  can  be  used  to 
design  data  structures.  The  data  structure  based  on  cuttings  is  called  a  cutting 
tree.  The  basic  structure  of  a  cutting  tree  for  a  set  L  of  n  lines  is  as  follows. 

■  If  the  cardinality  of  L  is  one  then  the  cutting  tree  consists  of  a  single  leaf 
where  L  is  stored  explicitly.  The  set  L  is  the  canonical  subset  of  the  leaf. 

■  Otherwise,  the  structure  is  a  tree  T.  There  is  a  one-to-one  correspondence 
between  the  children  of  the  root  of  the  tree  and  the  triangles  of  a  (1/r)- 
cutting  E (L)  for  the  set  L,  where  r  is  a  sufficiently  large  constant.  (Below 
we  shall  see  how  r  should  be  chosen.  )  The  triangle  of  the  cutting  that 
corresponds  to  a  child  v  is  denoted  by  t(v).  The  subset  of  lines  in  L  that  lie 
below  f(v)  is  called  the  lower  canonical  subset  of  v;  it  is  denoted  L  (v). 
The  subset  of  lines  in  L  that  lie  above  t (v)  is  called  the  upper  canonical 
subset  of  v;  it  is  denoted  L+(v).  The  subset  of  lines  that  cross  t  (v)  is  called 
the  crossing  subset  of  r(v).  The  child  v  is  the  root  of  a  recursively  defined 
partition  tree  on  its  crossing  subset;  this  subtree  is  denoted  by  Tv. 

■  With  each  child  v  we  store  the  triangle  t (v).  We  also  store  information  about 
the  lower  and  upper  canonical  subsets  L_(v)  and  L+(v);  for  counting  the 
number  of  lines  below  the  query  point  we  only  need  to  store  the  cardinality 
of  the  set  L“  (v),  but  for  other  applications  we  may  store  other  information. 

Figure  16.8  illustrates  the  notions  of  lower  canonical  subset,  upper  canonical 
subset,  and  crossing  subset.  We  describe  an  algorithm  for  selecting  the  lines 


upper  canonical  subset 
crossing  subset 
lower  canonical  subset 


from  L  below  a  query  point  in  a  number  of  canonical  subsets.  To  count  the 
number  of  such  lines  we  have  to  sum  the  cardinalities  of  the  selected  canonical 
subsets.  Let  q  be  the  query  point.  The  set  of  selected  nodes  will  be  denoted 
byY. 
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Algorithm  SelectBelowPointG^T) 

Input.  A  query  point  q  and  a  cutting  tree  or  subtree  of  it. 

Output.  A  set  of  canonical  nodes  for  all  lines  in  the  tree  that  lie  below  q 

1.  T<-0 

2.  if  T  consists  of  a  single  leaf  /i 

3.  then  if  the  line  stored  at  u  lies  below  q  then  Y  <—  { Lt } 

4.  else  for  each  child  v  of  the  root  of  T 

5.  do  Check  if  q  lies  in  f(v). 

6.  Let  vq  be  the  child  such  that  q  £  t(vg). 

7.  T  <—  {vq}  U  SelectBelowPoint(^,Tv<?) 

8.  return  T 

Lemma  16.8  Let  L  be  a  set  of  n  lines  in  the  plane.  Using  a  cutting  tree,  the 
lines  from  L  below  a  query  point  can  be  selected  in  O(logn)  time  in  O(logn) 
canonical  subsets.  As  a  consequence,  the  number  of  such  lines  can  be  counted 
in  0( log  n)  time.  For  any  e  >  0,  a  cutting  tree  on  L  can  be  constructed  that  uses 
0(n2+e)  storage. 

Proof.  Let  Q(n)  denote  the  query  time  in  a  cutting  tree  for  a  set  of  n  lines.  Then 
Q(n)  satisfies  the  recurrence: 

f  0(1)  if  n  =  1, 

^[n>  \  0(r2)  +  Q(n/r)  ifn>l. 

This  recurrence  solves  to  Q(n)  =  O(logn)  for  any  constant  r  >  1. 

Let  e  >  0  be  given.  According  to  Theorem  16.7  we  can  construct  a  (1/r)- 
cutting  for  L  of  size  cr2,  where  c  is  a  constant.  We  construct  a  cutting  tree  based 
on  (1/r)  -cuttings  for  r  =  |~ (2c) 1  /8] .  The  amount  of  storage  used  by  the  cutting 
tree,  M(n),  satisfies 
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0(1)  ifn=l, 

0(r2)  +'LvM(tlv)  if  w  >  1, 


where  we  sum  over  all  children  v  of  the  root  of  the  tree.  The  number  of  children 
of  the  root  is  cr2,  and  nv  f  n/r  for  each  child  v.  Hence,  with  our  choice  of  r 
the  recurrence  solves  to  M{n)  =  0{n2+e).  0 


We  conclude  that  we  can  count  the  number  of  lines  below  a  query  point  in 
0(log«)  time  with  a  structure  that  uses  0{n2+e)  storage.  By  duality,  we  can  do 
half-plane  range  counting  within  the  same  bounds.  Now  let’s  look  at  triangular 
range  counting  again:  given  a  set  S  of  points  in  the  plane,  count  the  number  of 
points  inside  a  query  triangle.  Following  the  approach  for  half-plane  queries, 
we  go  to  the  dual  plane.  What  problem  do  we  get  in  the  dual  plane?  The  set  of 
points  dualizes  to  a  set  of  lines,  of  course,  but  it  is  less  clear  what  happens  to  the 
query  triangle.  A  triangle  is  the  intersection  of  three  half-planes,  so  a  point  p  lies 
in  a  triangle  if  and  only  if  it  lies  in  each  of  the  half-planes.  In  Figure  16.9,  for 
instance,  point  p  lies  in  the  triangle  because  p  £  and  p  £  C\fcf  and  p  £  Off . 
The  line  dual  to  p  therefore  has  l\  *  above  it,  and  £2*  and  £3*  below  it.  In  general, 
the  dual  statement  of  the  triangular  range  searching  problem  is:  given  a  set  L 
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Figure  16.9 
Triangular  range  searching 


primal  plane 


1 2  ^3 


dual  plane 


of  lines  in  the  plane,  and  a  triple  <7 1,172,  <73  of  query  points  labeled  “above”  or 
“below”,  count  the  number  of  lines  from  L  that  lie  on  the  specibed  sides  of  the 
three  query  points.  This  problem  can  be  solved  with  a  three-level  cutting  tree. 
We  now  describe  a  data  structure  for  the  following,  slightly  simpler,  problem: 
given  a  set  L  of  lines,  and  a  pair  q\ ,  qi  of  query  points,  select  the  lines  that  lie 
below  both  query  points.  After  having  seen  the  two-level  cutting  tree  that  solves 
this  problem,  designing  a  three-level  cutting  tree  for  the  dual  of  the  triangular 
range  searching  problem  should  be  easy. 

A  two-level  cutting  tree  on  a  set  L  of  n  lines  for  selecting  the  lines  below  a 
pair  q\,q2  of  query  points  is  defined  as  follows. 

■  The  set  L  is  stored  in  a  cutting  tree  7. 

m  With  each  node  v  of  the  first-level  tree  T,  we  store  its  lower  canonical  subset 
in  a  second-level  cutting  tree  7ySSOC. 

The  idea  is  that  the  first  level  of  the  tree  is  used  to  select  the  lines  below  q\  in  a 
number  of  canonical  subsets.  The  associated  structures  (or,  trees  at  the  second 
level)  storing  the  selected  canonical  subsets  are  then  used  to  select  the  lines 
that  lie  below  q2 .  Because  the  associated  structures  are  one-level  cuttings  trees, 
we  can  use  algorithm  SelectBelowPoint  to  query  them.  The  total  query 
algorithm  is  thus  as  follows. 

Algorithm  SelectBelowPair(^i,  qi-,7) 

Input.  Two  query  points  q\  and  q2  and  a  cutting  tree  or  subtree  of  it. 

Output.  A  set  of  canonical  nodes  for  all  lines  in  the  tree  that  lie  below  q\  and  <72- 

1.  Y  <—  0 

2.  if  7  consists  of  a  single  leaf  p 

3.  then  if  the  line  stored  at  p  lies  below  q i  and  q2  then  T  <—  {p} 

4.  else  for  each  child  v  of  the  root  of  T 

5.  do  Check  if  q\  lies  in  f(v). 

6.  Let  vqi  be  the  child  such  that  <71  £  t(vqi). 

1.  Yi  <—  SelectBelowPoint^,^00) 

8.  T2  <—  SelectBelowPair(<7i,<72,Tv91) 

9.  Y4-Y1UY2 

10.  return  Y 

Recall  that  adding  an  extra  level  to  a  partition  tree  did  not  increase  its  query 

_  time,  whereas  the  amount  of  storage  it  used  increased  by  a  logarithmic  factor. 
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increases  the  query  time  by  a  logarithmic  factor,  whereas  the  amount  of  storage 
stays  the  same.  This  is  proved  in  the  next  lemma. 

Lemma  16.9  Let  L  be  a  set  of  n  lines  in  the  plane.  Using  a  two-level  cutting 
tree,  the  lines  from  L  below  a  pair  of  query  points  can  be  selected  in  0( log2  n) 
time  in  O(log2n)  canonical  subsets.  As  a  consequence,  the  number  of  such 
lines  can  be  counted  in  (9(log2  n)  time.  For  any  e  >  0,  such  a  two-level  cutting 
tree  on  L  can  be  constructed  that  uses  0(n2+e)  storage. 

Proof.  Let  Q(n)  denote  the  query  time  in  a  two-level  cutting  tree  for  a  set  of  n 
lines.  The  associated  structures  are  one-level  cutting  trees,  so  the  query  time  for 
the  associated  structures  is  O(logn)  by  Lemma  16.8.  Hence,  Q(n)  satisfies  the 
recurrence: 


f  0(1)  if  n  =  1, 

11  (  0(r2)  +  <9(logn)  +  Q(n/r)  if  n  >  1. 

This  recurrence  solves  to  Q(n)  =  Of  log2  n)  for  any  constant  r  >  1. 

Let  e  >  0  be  given.  According  to  Lemma  16.8  we  can  construct  the  associ¬ 
ated  structures  of  the  children  of  the  root  such  that  each  of  them  uses  0(«2+e) 
storage.  Hence,  the  amount  of  storage  used  by  the  cutting  tree,  M(n),  satisfies 

m(  1-/  °0)  if n  =  1, 

W  1  Lv\0(n2+£  +  M(nv)\  if  n  >  1, 

where  we  sum  over  all  children  v  of  the  root  of  the  tree.  The  number  of  children 
of  the  root  is  0(r2),  and  nv  f  n/r  for  each  child  V.  It  follows  that,  if  r  is  a  large 
enough  constant,  the  recurrence  solves  to  M{n)  =  0(n2+e).  (If  you  are  a  bit 
bored  by  now,  you  are  on  the  right  track:  cutting  trees,  partition  trees,  and  their 
multi-level  variants  are  all  analyzed  in  the  same  way.)  0 


We  designed  and  analyzed  a  two-level  cutting  tree  for  selecting  (or  counting) 
the  number  of  lines  below  a  pair  of  query  points.  For  the  triangular  range 
searching  we  need  a  three-level  cutting  tree.  The  design  and  analysis  of  three- 
level  cutting  trees  follows  exactly  the  same  pattern  as  for  two-level  cutting  trees. 
Therefore  you  should  hopefully  have  no  difficulties  in  proving  the  following 
result. 

Theorem  16.10  Let  S  be  a  set  ofn  points  in  the  plane.  For  any  e  >  0,  there  is  a 
data  structure  for  S,  called  a  cutting  tree,  that  uses  0(n2+e)  storage  such  that 
the  points  from  S  lying  inside  a  query  triangle  can  be  counted  in  0( log3  n)  time. 
The  points  can  be  reported  in  O(k)  additional  time,  where  k  is  the  number  of 
reported  points.  The  structure  can  be  constructed  in  0(n2+e)  time. 

One  can  even  do  a  little  bit  better  than  in  this  theorem.  This  is  discussed  in 
Section  16.4  and  in  the  exercises. 
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Chapter  16  16.4  Notes  and  Comments 

SIMPLEX  RANGE  SEARCHING 

Range  searching  is  one  of  the  best  studied  problems  in  computational  geometry. 
For  extensive  overviews  of  results  on  range  searching,  see  the  surveys  by  Agar- 
wal  [1]  and  Agarwal  and  Erickson  [2].  We  can  distinguish  between  orthogonal 
range  searching  and  simplex  range  searching.  Orthogonal  range  searching  was 
the  topic  of  Chapter  5.  In  this  chapter  we  discussed  the  planar  variant  of  simplex 
range  searching,  namely  triangular  range  searching.  We  conclude  with  a  brief 
overview  of  the  history  of  simplex  range  searching,  and  a  discussion  of  the 
higher-dimensional  variants  of  the  theory  we  presented. 

We  begin  our  discussion  with  data  structures  for  simplex  range  searching  in 
the  plane  that  use  roughly  linear  storage.  Willard  [388]  was  the  first  to  present 
such  a  data  structure.  His  structure  is  based  on  the  same  idea  as  the  partition 
trees  described  in  this  chapter,  namely  a  partition  of  the  plane  into  regions.  His 
partition,  however,  did  not  have  such  a  good  crossing  number,  so  the  query  time 
of  his  structure  was  0(n°'774).  As  better  simplicial  partitions  were  developed, 
more  efficient  partition  trees  were  possible  [111,  169,  209,  394].  Improvements 
were  also  obtained  using  a  somewhat  different  structure  than  a  partition  tree, 
namely  a  spanning  tree  with  low  stabbing  number  [112,  384].  The  best  solution 
for  triangular  range  searching  so  far  has  been  given  by  Matousek  [263].  Theo¬ 
rem  16.1  is  proved  in  that  paper.  Matousek  also  describes  a  more  complicated 
data  structure  with  (9(v/n20(log*"')  query  time.  This  structure,  however,  cannot 
be  used  so  easily  as  a  basis  for  multi-level  trees. 

The  simplex  range  searching  problem  in  Kf/  is  stated  as  follows:  preprocess 
a  set  S  of  points  in  R(/  into  a  data  structure,  such  that  the  points  from  S  lying 
in  a  query  simplex  can  be  counted  (or  reported)  efficiently.  Matousek  also 
proved  results  for  simplicial  partitions  in  higher  dimensions.  The  definition 
of  simplicial  partitions  in  is  similar  to  the  definition  in  the  plane;  the  only 
difference  is  that  the  triangles  of  the  partition  are  replaced  by  cZ-simplices,  and 
that  the  crossing  number  is  defined  with  respect  to  hyperplanes  instead  of  lines. 
Matousek  proved  that  any  set  of  points  in  R7  admits  a  simplicial  partition  of 
size  r  with  crossing  number  Ot'r1  1  '/f/) .  Using  such  a  simplicial  partition  one 
can  construct,  for  any  e  >  0,  a  partition  tree  for  simplex  range  searching  in  W1 
that  uses  linear  space  and  has  0(nl  query  time.  The  query  time  can 

be  improved  to  (9(n1_1  / d (log«)°^) .  The  query  time  of  Matousek’s  structure 
comes  close  to  the  lower  bounds  proved  by  Chazelle  [89],  which  state  that  a 
data  structure  for  triangular  range  searching  that  uses  0(m)  storage  must  have 
£l(n/ (rri'id lognj)  query  time.  A  structure  that  uses  linear  space  must  thus  have 
f2(n1-1/d/logn)  query  time.  (In  the  plane,  a  slightly  sharper  lower  bound  is 
known,  namely  £l(n/^Jm)) 

Data  structures  for  simplex  range  searching  with  logarithmic  query  time 
have  also  received  a  lot  of  attention.  Clarkson  [131]  was  the  first  to  realize 
that  cuttings  can  be  used  as  the  basis  for  a  data  structure  for  range  searching. 
Using  a  probabilistic  argument,  he  proved  the  existence  of  0  (log  r/r) -cuttings 
of  size  0(ijl )  for  sets  of  hyperplanes  in  M.d,  and  he  used  this  to  develop  a  data 
structure  for  half-space  range  queries.  After  this,  several  people  worked  on 
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improving  the  results  and  on  developing  efficient  algorithms  for  computing  Section  16.5 

cuttings.  Currently  the  best  known  algorithm  is  by  Chazelle  [95].  He  has  shown  exercises 

that  for  any  parameter  r,  it  is  possible  to  compute  a  (l/r)-cutting  of  size  0(rd) 

with  a  deterministic  algorithm  that  takes  0(nrd_1)  time.  These  cuttings  can 

be  used  to  design  a  (multi-level)  cutting  tree  for  simplex  range  searching,  as 

shown  in  this  chapter  for  the  planar  case.  The  resulting  data  structure  has 

O(\ogd  n)  query  time  and  uses  0(nd+e)  storage.  The  query  time  can  be  reduced 

to  O(\ogn).  Due  to  a  special  property  of  Chazelle’s  cuttings,  it  is  also  possible 

to  get  rid  of  the  0(ne)  factor  in  the  storage  [265],  but  for  the  new  structure  it 

is  no  longer  possible  to  reduce  the  query  time  to  O(\ogn).  These  bounds  are 

again  close  to  Chazelle’s  lower  bounds. 

By  combining  partition  trees  and  cutting  trees  the  right  way,  one  can  get 
data  structures  that  have  storage  in  between  that  of  partition  trees  and  cutting 
trees.  In  particular,  for  any  n  ^  m  nd ,  a  data  structure  of  size  0(ml  e)  has 
0(n1+e /ml/d)  query  time,  close  to  the  lower  bound:  Exercise  16.16  shows  how 
to  do  this. 

Partition  trees  use  linear  space,  but  their  query  time  is  rather  high.  Cutting 
trees,  on  the  other  hand,  have  logarithmic  query  time  but  they  need  a  lot  of 
storage.  Ideally  one  would  like  to  have  a  structure  that  uses  linear  space  and 
has  logarithmic  query  time.  While  Chazelle’s  lower  bounds  [89]  show  that 
this  is  not  possible  for  exact  range  searching,  one  can  achieve  such  bounds  for 
approximate  range  searching.  The  idea  here  is  that  points  that  are  “almost”  in 
the  query  range  (that  is,  that  are  very  close  to  it)  may  be  reported  as  well — see 
the  survey  by  Duncan  and  Goodrich  [151]  for  details. 


In  the  discussion  above  we  have  concentrated  on  simplex  range  searching.  Half¬ 
space  range  searching  is,  of  course,  a  special  case  of  this.  It  turns  out  that  for 
this  special  case  better  results  can  be  achieved.  For  example,  for  half-plane 
range  reporting  (not  for  counting)  in  the  plane,  there  is  a  data  structure  with 
O(logn  +  k)  query  time  that  uses  0[n)  storage  [107].  Here  k  is  the  number  of 
reported  points.  Improved  results  are  possible  in  higher  dimensions  as  well: 
the  points  in  a  query  half-space  can  be  reported  in  0(m1~1/L‘V2J  (logn)°0)  +  £) 
time  with  a  data  structure  that  uses  D(nloglog«)  storage  [264]. 

Finally,  Agarwal  and  Matousek  [8]  generalized  the  results  on  range  search¬ 
ing  to  query  ranges  that  are  semi-algebraic  sets. 


16.5  Exercises 

16.1  Let  S  be  a  set  of  n  points  in  the  plane. 

a.  Suppose  that  the  points  in  S  lie  on  a  yjn  x  ^Jn  grid.  (Assume  for  sim¬ 
plicity  that  n  is  a  square.)  Let  r  be  a  parameter  with  1  ^  r  ^  n.  Draw 
a  fine  simplicial  partition  for  S  of  size  r  and  crossing  number  0(^/r). 

b.  Now  suppose  all  points  from  S  are  collinear.  Draw  a  fine  simplicial 
partition  for  S  of  size  r.  What  is  the  crossing  number  of  your  partition? 
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Chapter  16  16.2  Prove  that  the  selected  nodes  in  a  partition  tree  are  exactly  the  nodes  v 

simplex  range  searching  with  the  following  property:  t (v)  C  h  (or,  in  case  v  is  a  leaf,  the  point 

stored  at  v  lies  in  h )  and  there  is  no  ancestor  /i  of  v  such  that  t(jl')  C  h. 
Use  this  to  prove  that  S  ft  h  is  the  disjoint  union  of  the  canonical  subsets 
of  the  selected  nodes. 


16.3  Prove  that  the  recurrence  for  M(n)  given  in  the  proof  of  Lemma  16.2 
solves  to  M(n )  =  0(n). 

16.4  Prove  that  the  recurrence  for  Q(n)  given  in  the  proof  of  Lemma  16.3 
solves  to  Q(n)  =  0(nl!2+e). 

16.5  Suppose  we  have  a  partition  tree  as  defined  on  page  339,  except  that 
the  simplicial  partitions  used  in  the  construction  are  not  necessarily  fine. 
What  does  this  mean  for  the  amount  of  storage  used  by  the  partition 
tree?  And  for  its  query  time? 

16.6  Lemma  16.3  shows  that  for  any  e  >  0  we  can  build  a  partition  tree  with 
0(n]:2 1  £'j  query  time,  by  choosing  the  parameter  r  that  determines  the 
branching  degree  of  the  tree  to  be  a  large  enough  constant.  We  can  do 
even  better  if  we  choose  r  depending  on  n.  Show  that  the  query  time 
reduces  to  0(^n  log«)  if  we  choose  r  =  yfn.  (Note  that  the  value  of  r  is 
not  the  same  any  more  at  different  nodes  in  the  tree.  However,  this  is 
not  a  problem.) 

16.7  Prove  that  the  recurrence  for  M(n)  given  in  the  proof  of  Lemma  16.5 
solves  to  M(n )  =  O(n\ogn). 

16.8  Prove  that  the  recurrence  for  Q(n)  given  in  the  proof  of  Lemma  16.6 
solves  to  Q{n)  =  0{nl!2+e). 

16.9  Let  T  be  a  set  of  n  triangles  in  the  plane.  An  inverse  range  counting 
query  asks  to  count  the  number  of  triangles  from  T  containing  a  query 
point  q. 

a.  Design  a  data  structure  for  inverse  range  counting  queries  that  uses 
roughly  linear  storage  (for  example,  O(n\ogc n)  for  some  constant  c). 
Analyze  the  amount  of  storage  and  the  query  time  of  your  data  struc¬ 
ture. 

b.  Can  you  do  better  if  you  know  that  all  triangles  are  disjoint? 

16.10  Let  L  be  a  set  of  n  lines  in  the  plane. 

a.  Suppose  that  L  consists  of  \n/ 2J  vertical  lines  and  \n/ 2]  horizontal 
lines.  Let  r  be  a  parameter  with  1  ^  r  ^  n.  Draw  a  ( 1  / r)  -cutting  for  L 
of  size  0(r2). 

b.  Now  suppose  all  lines  from  L  are  vertical.  Draw  a  ( 1  / r)-cutting  for  L. 
What  is  the  size  of  your  cutting? 

16.11  Prove  that  the  recurrences  for  Q(n)  and  M(n)  given  in  the  proof  of 
Lemma  16.8  solve  to  Q(n)  =  O(\ogn)  and  M(n)  =  0(n2+e). 
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16.12  Prove  that  the  recurrences  for  Q(n)  and  M{ri)  given  in  the  proof  of 
Lemma  16.9  solve  to  Q(n )  =  0(log2n)  and  M{n )  =  0(n2+e). 

16.13  A  query  in  a  two-level  cutting  tree  visits  the  associated  structures  of  the 
nodes  on  one  path  in  the  tree.  The  query  time  in  an  associated  structure 
storing  m  lines  is  O(logm)  by  Lemma  16.8.  Because  the  depth  of  the 
main  tree  is  (9(log«),  the  total  query  time  is  0(log2n).  If  we  could 
choose  the  value  of  the  parameter  r  of  the  main  cutting  tree  larger  than 
constant,  for  example  n5  for  some  small  8  >  0,  then  the  depth  of  the 
main  tree  would  become  smaller,  resulting  in  a  reduction  of  the  query 
time.  Unfortunately,  there  is  an  0(r2)  term  in  the  recurrence  for  Q{n)  in 
the  proof  of  Lemma  16.9. 

a.  Describe  a  way  to  get  around  this  problem,  so  that  you  can  choose 
r  :=  ns . 

b.  Prove  that  the  query  time  of  your  two-level  data  structure  is  O(logn). 

c.  Prove  that  the  amount  of  storage  of  the  data  structure  is  still  0(n2+e). 

16.14  Design  a  data  structure  for  triangular  range  searching  that  has  <9(log3  n) 
query  time.  Describe  the  data  structure  as  well  as  the  query  algorithm 
precisely,  and  analyze  both  storage  and  query  time. 

16.15  Let  S  be  a  set  of  n  points  in  the  plane,  each  having  a  positive  real  weight 
associated  with  them.  Describe  two  data  structures  for  the  following 
query  problem:  find  the  point  in  a  query  half-plane  with  the  largest 
weight.  One  data  structure  should  use  linear  storage,  and  the  other  data 
structure  should  have  logarithmic  query  time.  Analyze  the  amount  of 
storage  and  the  query  time  of  both  data  structures. 

16.16  In  this  chapter  we  have  seen  a  data  structure  for  half-plane  range  search¬ 
ing  with  linear  storage  but  a  rather  high  query  time  (the  partition  tree) 
and  a  data  structure  with  logarithmic  query  time  but  a  rather  high  use  of 
storage  (the  cutting  tree).  Sometimes  one  would  like  to  have  something 
in  between:  a  data  structure  that  has  a  better  query  time  than  partition 
trees,  but  uses  less  storage  than  a  cutting  tree.  In  this  exercise  we  show 
how  to  design  such  a  structure. 

Suppose  that  we  have  0(ml+e )  storage  available,  for  some  m  between  n 
and  n2.  We  want  to  build  a  structure  for  selecting  points  in  half-planes 
that  uses  0(ml+e)  storage  and  has  as  fast  a  query  time  as  possible. 
The  idea  is  to  start  with  the  fastest  structure  we  have  (the  cutting  tree) 
and  switch  to  the  slow  structure  (the  partition  tree)  when  we  run  out 
of  storage.  That  is,  we  continue  the  construction  of  the  cutting  tree 
recursively  until  the  number  of  lines  we  have  to  store  drops  below  some 
threshold  n. 

a.  Describe  the  data  structure  and  the  query  algorithm  in  detail. 

b.  Compute  the  value  of  the  threshold  n  such  that  the  amount  of  storage 
is  0(m1+e). 

c.  Analyze  the  query  time  of  the  resulting  data  structure. 


Section  16.5 

EXERCISES 
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quadtree,  115,  307,  309 
range  tree,  105,  109 
segment  tree,  231,  233,  231 
database,  95,  116,  335 
database  query,  95 
Davenport-Schinzel  sequence,  188 
decomposable  searching  problem,  238 
decomposition 

trapezoidal,  124 
vertical,  124 
degeneracy,  5,  8,  137 
degenerate  case,  5,  8,  137 
degree  of  freedom,  284 
Delaunay  corner,  213 
Delaunay  graph,  1 96 
Delaunay  triangulation,  168,  197 
computation,  199 
pseudo  code,  200 

DelaunayTriangulation,  200 

density,  27 1 

depth  order,  260 

design  for  assembly,  12 

destination 

of  half-edge,  31 


diagonal 

of  polygon,  46 
difference 

of  polygons.  39 

Dijkstra’s  algorithm,  324,  326,  331 
direction 

representation  of,  65 
Dirichlet  tessellation,  167 
disc 

smallest  enclosing,  86 
discrepancy,  174,  175,  188 
computation,  175 
half-plane.  175 
discrete  measure,  175 
distance 

Euclidean.  148,  169,  332 
Lu  168 

L2.  148,  169,  332 
Lp,  168 
link,  332 
Manhattan,  168 
distance  function,  169 
domain 

of  mesh,  308 
of  terrain,  191 
double  wedge,  178 

doubly-connected  edge  list,  29,  30,  48, 
155,  180,  247 

dual 

of  line.  178 
of  object,  177 
of  point,  177 
of  segment.  178 
dual  graph 

of  triangulated  polygon,  47 
dual  plane,  178 
duality 

in  higher  dimensions.  186 
in  the  plane,  177 
dynamic  convex  hull,  13 
dynamic  point  location,  143 
dynamization,  237 

edge,  30 

illegal,  194 
of  poly  tope,  245 
edge  flip,  194 
edge  list 

doubly-connected,  29, 30, 48,  155, 
180,  247 

element 


of  mesh.  307  INDEX 

elementary  interval,  232 
ellipse 

smallest  enclosing,  90 
embedding 

of  graph,  30 
end  vertex,  50 
envelope 

lower,  253 
upper,  254 

Euclidean  distance.  148,  169,  332 
Euclidean  minimum  spanning  tree,  215, 

216 

Euler’s  formula,  28,  150.  245 
event,  22,  51,  151 
circle,  155,  161 
site,  153,  161 
event  point,  22,  51,  151 
event  queue,  24,  51,  155 
exact  arithmetic,  9 
exact  cell  decomposition,  303 
exact  match  query,  116 
expectation 

linearity  of,  78,  134,  205 
expected  performance,  78.  133 
expected  running  time,  78 

face,  30 

record  of,  3 1 
facet,  64.  245 

ordinary,  64 
top,  64 

false  alarm,  156 

farthest-point  Voronoi  diagram,  164,  169 
fat  subdivision 

point  location  in,  144 
fatness,  279 
feasible  point,  7 1 
feasible  region,  7 1 
feasible  solution,  71 
Fibonacci  heap,  331 
FindIntersections,  25 
FindNewEvent,  27 
FindSplitNode,  97 
finite  element  method,  307 
first-level  tree,  106,  344 
flap,  251 
flip 

of  edge,  194 

floating  point  arithmetic,  5  - 

FollowSegment,  130  379 


INDEX  forbidden  configuration  space,  285,  330 
ForbiddenSpace,  298 
Fortune’s  algorithm,  151 
fractional  cascading,  109,  112,  143,  221 
free  configuration  space,  285,  324,  330 
free  path,  305 
free  space,  285,  324,  330 
representation  of,  287 
trapezoidal  map  of,  287,  324 
free  split,  265 

Gabriel  graph.  215,  217 
general  position,  9,  124 
GenerateMesh,  316 
genus,  245 

geographic  information  systems,  1,11, 
15 

geometric  graph,  215 
geometric  modeling,  15 
GIS,  11 

Graham’s  scan,  13 
graph 

Gabriel,  215,  217 
geometric,  215 

relative  neighborhood,  215,  217 
visibility,  323 
grid,  116 
guard 

for  low-density  scene,  272 

half-edge,  31 

destination  of,  31 
origin  of,  3 1 
record  of,  32 

half-plane  discrepancy,  175 
half-planes 

common  intersection  of,  66,  89 
HandleCircleEvent,  158 
HandleEndVertex,  53 
HandleEventPoint,  26 
HandleMerge  Vertex,  54 
HandleRegularVertex,  54 
HandleSiteEvent,  158 
HandleSplitVertex,  53 
Handles  tart  Vertex,  53 
harmonic  number,  135 
heap,  227 

Helly-type  theorem,  90 
hidden  surface  removal,  259 
higher-dimensional  linear  programming, 
82 


higher-order  Voronoi  diagram,  169 

horizon,  247 

hull 

convex,  2,  89,  193,  243 
lower,  6,  254 
upper,  6,  253 

illegal  edge,  194 
implicit  point  location,  144 
incidence  preserving,  178 
incident,  30 

infeasible  linear  program,  7 1 
infeasible  point,  7 1 
inner  vertex,  325 
InsertSegmentTree,  234 
interpolation 

data-independent,  214 
linear,  191 

IntersectHalfplanes,  67 
intersection 

of  half-planes,  66,  89 
of  line  segments,  19 
of  polygons,  39 

intersection-sensitive  algorithm,  21 
interval 

elementary,  232 
interval  tree,  220,  222,  237 
inversion,  186 
isolated  vertex,  3 1 
iterated  logarithm,  60 

jaggies,  174 
Jarvis’s  march,  13 
joint 

of  robot,  283 
prismatic,  283 
revolute,  283 

k-level 

in  arrangement,  187 
k-  set,  187 
kd-tree,  100,  116 

L\ -metric,  168 
L2-metric,  148,  169,  332 
Lp -metric,  168 
layer,  19,  335 
layered  range  tree,  113 
legal  triangulation,  195 
LegalizeEdge,  201 
LegalTriangulation,  195 
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level 
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in  arrangement,  185 
in  data  structure,  106,  343 
line  segment 

axis-parallel,  220 
line  segment  intersection,  19,  40 
computation,  20 
counting,  41 
lower  bound,  40 
pseudo  code,  25 
red-blue,  40 

linear  constraint,  65,  66,  71 
linear  interpolation,  191 
linear  optimization,  7 1 
linear  program,  7 1 
bounded,  73 
dimension  of,  7 1 
infeasible,  71 
unbounded,  73,  79 
linear  programming,  7 1 
1 -dimensional,  75 
higher-dimensional,  82 
low-dimensional,  72 
pseudo  code,  75,  77,  84 
linearity  of  expectation,  78,  134,  205 
link  metric,  332 
locally  shortest  path,  325 
locus  approach,  231 
logarithm 

iterated,  60 

LowDensityBSP2d,  276 
lower  bound 

convex  hull,  13 
line  segment  intersection,  40 
of  auto-partition,  269 
of  binary  space  partition,  270 
range  tree,  116 
simplex  range  searching,  352 
triangulation,  59 
Voronoi  diagram,  170 
lower  envelope,  253 
lower  hull,  6,  254 
lower  vertical  extension,  124 
LP-type  problem,  90 

MakeMonotone,  53 
Manhattan  metric,  168 
manufacturing,  12,  15,  63,  89 
map 

point  location  on,  122 
trapezoidal,  122,  124,  287 


map  layer,  79,  335 
map  overlay,  1,  20 
computation,  33 
pseudo  code,  38 
MapOverlay,  38 
Markov’s  inequality,  141 
maximal  planar  subdivision,  193 
measure 

continuous,  175 
discrete,  175 
medial  axis,  169 
merge  vertex,  50 
mesh,  307 

conforming,  308,  319 
consistent,  319 
domain  of,  308 
non-uniform,  308 
requirements,  308 
respecting  the  input,  308 
structured,  319 
triangular,  308 
uniform,  309 
unstructured,  319 
mesh  element,  307 
mesh  generation,  307 
pseudo  code,  316 
metric 

combined.  332 
Euclidean,  148,  169,  332 
Lu  168 

L2,  148,  169,  332 
Lp,  168 
link,  332 
Manhattan,  168 
MlNlDlSC,  86 

MiniDiscWith2Points,  87 
MiniDiscWithPoint,  87 
minimum  spanning  tree 
Euclidean,  215,  216 
minimum  weight  triangulation,  215,218 
Minkowski  difference,  292 
Minkowski  sum,  291,  305,  330 
complexity,  297 
computation,  295 
of  non-convex  polygons,  296 
pseudo  code,  295 
MinkowskiSum,  295 
mixture,  243 
mobile  robot,  283,  323 
molecular  modeling,  12 
monotone  polygon,  49 
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Index  strictly,  55 

triangulation  of,  55 
motion  planning,  1,  14,  162,  283,  323 
exact  cell  decomposition,  303 
retraction  method,  304 
road  map  for,  288 
mouse 

clicking  with,  5,  122 
multi-level  data  structure,  106,  238,  343 

neighbor  finding 

in  quadtree,  312 
non-crossing,  124 
non-linear  optimization,  86,  90 
non-uniform  mesh,  308 
NorthNeighbor,  313 

objective  function,  7 1 
obstacle,  283,  324 

configuration-space,  285 
obtuse  angle,  319 
octree,  318 

operations  research,  72 
optimal  vertex,  73 
optimization 

combinatorial,  90 
linear,  7 1 
non-linear,  86,  90 
order  preserving,  178 
order-fc  Voronoi  diagram,  169,  188 
ordinary  facet,  64 
origin 

of  half-edge,  3 1 
orthogonal  line  segment,  220 
orthogonal  range  query,  96 
output-sensitive  algorithm,  21,99 
overlap 

cyclic,  260 

overlay,  1,  20,  33,  167 
computation,  33 
pseudo  code,  38 

painter’s  algorithm,  260 
PaintersAlgorithm,  263 
paraboloid,  178 
parameter  space,  232 
ParanoidMaximum,  93 
partial  match  query,  116 
partition 

-  auto,  262 
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simplicial,  337 
partition  tree,  336,  338 
path 

free,  305 

locally  shortest,  325 
semi-free,  305 
persistency,  143 
perturbation 

symbolic,  9,  14 
PHASEl,  275 
pixel,  173,  259 
placement  of  robot,  284 
planar  point  location,  122 
planar  subdivision,  30,  121 
maximal,  193 

plane  sweep,  22,  34,  51,  68,  151,  328 
rotational,  328 
pocket,  61 
point 

infeasible,  7 1 
Steiner,  309 
point  location,  121 
dynamic,  143 
higher  dimensional,  143 
implicit,  144 
planar,  121 
query,  128 

point  location  query,  121 
point  robot,  286,  324 
polygon 

monotone,  49 
rectilinear,  60 
simple,  46 
star-shaped,  93,  145 
strictly  monotone,  55 
triangulation  of,  45 
y-monotone,  49 
polyhedral  terrain,  192 
polyhedron,  64 
polytope,  244 

simplicial,  245 
tetrahedralization  of,  60 
potential  field  method,  305 
power  diagram,  169 
preprocessing,  122 
primal  plane,  178 
primitive  operation,  4 
printed  circuit  board,  307 
priority  search  tree,  226,  237 
prismatic  joint,  283 
pseudodisc,  292 


pseudodisc  property,  293 
pseudodiscs 

complexity  of  union,  294 

quadtree,  115,273,307,  309 
balanced,  313 
compressed,  318 
higher-dimensional,  318 
neighbor  finding,  312 
quadtree  subdivision,  310 
query 

database,  95 
exact  match,  116 
partial  match,  116 
point  location,  121,  128 
range,  96 

simplex  range,  352 
stabbing,  237 
stabbing  counting,  237 
triangular  range,  336 
windowing,  219,  335 
QUERYlNTERVALTREE,  224 
QueryPrioSearchTree,  230 
QuerySegmentTree,  234 

random  number  generator,  77 
random  permutation 
computation,  77 

randomized  algorithm,  75,  77,  78,  84, 
128,  199,  208,  246,  264 
analysis,  76,  78,  88,  133,  205, 
210,  250,  265 
RANDOMIZEDLP,  84 
RandomPermutation,  77 
range  query,  220,  318 

1 - dimensional,  96 

2- dimensional,  99 
counting,  119 
orthogonal,  96 
rectangular.  96 
simplex,  352 
triangular,  336 

range  searching 

approximate,  353 
range  tree,  99,  105,  109,  220 

1 - dimensional,  99 

2- dimensional,  106 
rf-dimensional,  109 
layered,  113 
lower  bound,  116 

ray  shooting,  146.  332 


ray  tracing,  173 
rectangular  range  query,  96 
rectangular  subdivision 

point  location  in,  144 
rectilinear  polygon,  60 
red-blue  line  segment  intersection,  40 
redundant  half-plane,  92 
reference  point,  284 
region 

feasible,  7 1 
regular  vertex,  50 

relative  neighborhood  graph,  215,  217 

rendering,  173,  259 

ReportInSubtree,  229 

Retraction,  162 

retraction,  162,  163 

retraction  method,  304 

revolute  joint,  283 

road  map,  288,  324 

robot 

articulated,  283 
autonomous,  283 
car-like,  284 
configuration  of,  284 
configuration  space  of,  285 
degree  of  freedom,  284 
mobile,  283 

motion  planning  for,  283 
open  or  closed,  286 
parameter  space  of,  285 
placement  of,  284 
point,  286,  324 
reference  point  of,  284 
road  map  for,  288 
work  space  of,  284 
robot  arm,  86,  283 
robotics,  1,  10,  14,  283 
robustness,  5,  8,  9 
rotational  plane  sweep,  328 
rounding  error,  5 
roundness,  163 
running  time 

average,  78 
expected,  78 

sample  points,  174 
scan-conversion,  259 
search  structure 

for  point  location,  128 
search  tree 

binary,  96 
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Index  SearchKdTree,  103 

second-level  tree,  106,  344 

see,  325 

segment 

axis-parallel,  220 
segment  tree,  231,  233,  237 
SelectBelowPair,  350 
SelectBelowPoint,  348 
SelectInHalfplane,  339 
SelectIntSegments,  344 
semi-free  path,  305 
shear  transformation,  137 
shortest  path,  323 

for  polygonal  robot,  330 
in  graph,  331 
ShortestPath,  326 
side 

in  trapezoidal  map,  125 
simple  arrangement,  1 80 
simple  polygon,  46 
simplex  algorithm,  72,  90 
simplex  range  query,  352 
lower  bound,  352 
simplicial  partition,  337 
simplicial  polytope,  245 
single  cell,  304 
site,  147 

line  segment,  160 
site  event,  153,  161 
skeleton 

of  polygon,  169 
slab,  122,  235 
SlowConvexHull,  3 
smallest  enclosing  ball,  90 
smallest  enclosing  disc,  86 
smallest  enclosing  ellipse,  90 
smallest- width  annulus,  163 
solution 

feasible,  7 1 

split 

free,  265 
split  vertex,  50 
stabber,  189.  190 
stabbing  counting  query,  237 
stabbing  number 
of  polygon,  61 
stabbing  query,  237 
star-shaped  polygon,  93,  145 
start  vertex,  50 
status 


of  sweep  line,  22,  25,  52,  155, 
328 

status  structure,  25,  52,  155,  328 
Steiner  point,  214,  309 
Steiner  triangulation,  309 
strictly  monotone  polygon,  55 
structural  change,  205,  210,  250 
subdivision,  30,  121 
complexity  of,  30 
connected,  30 
maximal  planar,  193 
quadtree,  310 
representation  of.  29 
subdivision  overlay,  33 
sum 

Minkowski,  291 
of  two  points,  291 
vector,  291 
supersampling,  174 
sweep  algorithm,  22,  34,  51,  68,  151, 
328 

rotational,  328 
sweep  line,  22,  51,  151,  160 
symbolic  perturbation,  9,  14 

tail  estimate,  140 
terrain,  191 

domain  of,  191 
polyhedral,  192 
tetrahedralization 
of  polytope,  60 
Thales’  Theorem,  194 
thematic  map  layer,  19,  335 
thematic  map  overlay,  20 
top  facet,  64 
trading  area,  147 
transform 

duality,  177 
inversion,  186 
shear,  137 
translating  polygon 

conbguration  space  of,  291 
configuration-space  obstacle  for, 
291 

transversal,  189,  190 
trapezoidal  decomposition,  124 
trapezoidal  map,  122,  124,  287,  324 
complexity  of.  127 
computation,  128 
of  the  free  space,  287,  324 
pseudo  code,  129 


TrapezoidalMap,  129 
traveling  salesman  problem,  216 
tree 

1 -dimensional  range,  99 

binary  search,  96 

binary  space  partition,  261 

BSP,  261 

cutting,  346 

heap,  227 

interval,  220,  222,  237 
kd-tree,  100 
octree,  318 
partition,  336,  338 
priority  search,  226,  237 
quadtree,  115,  307,  309 
range,  105,  109 
segment,  231,  233 ,  237 
triangle 

well-shaped,  308 
triangular  mesh,  308 
triangular  range  query,  336 
triangulated  polygon 
3-coloring,  47,  59 
dual  graph,  47 

TriangulateMonotonePolygon, 

57 

triangulation 

computation,  49,  55 
Delaunay,  1 97 
legal,  195 
lower  bound,  59 
minimum  weight,  215,  218 
of  monotone  polygon,  55 
of  point  set,  192, 193 
of  polygon,  45,  46 
pseudo  code,  53,  57 
Steiner,  309 

triangulation  refinement 
point  location  by,  143 
turn  vertex,  49 
twin 

of  half-edge,  3 1 

unbounded  linear  program,  73.  79 

uniform  mesh,  309 

union 

complexity  of,  304 
of  polygons.  39 
Upper  Bound  Theorem.  256 
upper  envelope,  254 
upper  hull,  6,  253 


upper  vertical  extension,  124  INDEX 

vector  sum,  291 
vertex,  30 

inner,  325 
isolated,  31 
optimal.  73 
record  of,  31 

vertical  decomposition,  124 
vertical  extension,  124 
lower,  124 
upper,  124 
viewing  volume,  220 
visibility  complex,  332 
visibility  edge,  325 
visibility  graph,  323 
computation,  326 
pseudo  code,  326 
visibility— Voronoi  complex,  332 
VisibilityGraph,  326 
Visible,  329 
visible,  48,  259,  325 
visible  region,  61,  247 
VisibleVertices,  328 
Voronoi  assignment  model,  148 
Voronoi  cell,  149 
Voronoi  diagram,  1,  147,  148,  254 
abstract,  169 
complexity  of,  149,  168 
computation,  151 
degenerate  cases,  159 
farthest-point,  164,  169 
higher-order,  169 
lower  bound,  170 
medial  axis,  169 
of  line  segments,  160 
order-k,  169,  188 
power  diagram,  169 
pseudo  code,  157 
skeleton,  169 
weighted,  169 
VoronoiDiagram,  157 

weighted  Voronoi  diagram,  169 
well-shaped  triangle,  308 
window,  219,  335 
window  system,  122 
windowing  query,  219,  335 
work  space,  284,  324 


y-monotone  polygon.  49  385 


INDEX  z-buffer  algorithm,  259 
zone 

of  hyperplane,  187 
of  line,  183,  269 
Zone  Theorem,  184,  187 
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